Information Processing Method and Apparatus, Program and 
Recording Medium 
CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] The present application claims priority from 
International Application No. PCT/ JP01/03417 which was published 
in Japanese on November 1, 2001, the disclosure of which is 
hereby incorporated by reference herein. 
BACKGROUND OF THE INVENTION 

[0002] This invention relates to an information processing 
method and apparatus, a program, and a recording medium and, more 
particularly, to an information processing method and apparatus, 
a program and a recording medium configured for maintaining 
continuity of moving pictures in a reproducing domain. 
[0003] Recently, a variety of types of optical discs have been 
proposed as a recording medium that can be removed from a 
recording apparatus. These recordable optical discs have been 
proposed as a large capacity medium of several GBs and are 
thought to be promising as a medium for recording AV 
(audio/video) signals, such as video signals. Among the sources 
of the digital AV signals (supply sources) to be recorded on this 
recordable optical disc, there are CS digital satellite 
broadcasts and BS digital broadcasts. Additionally, ground wave 
television broadcasts of the digital system have also been 
proposed for -future use. 

[0004] The digital video signals, supplied from these sources, 
are routinely compressed under the MPEG (Motion Picture Experts 
Group) 2 system. In a recording apparatus, a recording rate 
proper to the apparatus is set. If digital video signals of the 
digital broadcast are recorded in the conventional image storage 
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mediums for domestic use, digital video signals are first decoded 
and subsequently bandwidth-limited for recording. In the case of 
the digital recording system, including, of course, the MPEG1 
video, MPEG2 video and DV systems, digital video signals are 
first decoded and subsequently re-encoded in accordance with an 
encoding system for the recording rate proper to the apparatus 
for recording subsequently. 

[0005] However, this recording system, in which the supplied 
bit stream is decoded once and subsequently bandwidth-limited and 
re-encoded prior to recording, suffers from deteriorated picture 
quality. If, in recording compressed digital video signals, the 
transmission rate of input digital signals is less than the 
recording rate for the recording and/or reproducing apparatus, 
the method of directly recording the supplied bit stream without 
decoding or re-encoding suffers from deterioration in the picture 
quality only to the least extent. However, if the transmission 
rate of the input digital signals exceeds the recording rate of 
the recording and/or reproducing apparatus, it is indeed 
necessary to re-encode the bit stream and to record the re- 
encoded bit stream so that, after decoding in the recording 
and/or reproducing apparatus, the transmission rate will be not 
higher than the upper limit of the disc recording rate. 

[0006] If the bit stream is transmitted in a variable rate 
system in which the bit rate of the input digital signal is 
increased or decreased with time, the capacity of the recording 
medium can be exploited less wastefully in the case of a disc 
recording apparatus adapted for transiently storing data in a 
buffer and for recording the data in a burst fashion than in the 
case of a tape recording system having a fixed recording rate 
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imposed by the fixed rpm of the rotary head. 

[0007] Thus, it may be predicted that, in the near future when 
digital broadcasts are to become the mainstream, an increasing 
demand will be created for a recording and/or reproducing 
apparatus in which broadcast signals are recorded as digital 
signals, without decoding or re-encoding, as in a data streamer, 
and in which a disc is used as a recording medium. 

[0008] In reproducing data recorded on a recording medium in 
the above-described recording apparatus, there is known so-called 
skip reproduction in which reproduction is continued up to a 
preset picture and a picture temporally spaced apart from the 
preset picture is reproduced next. This skip reproduction has a 
drawback that temporal continuity is lost in the reproduced 
pictures . 

SUMMARY OF THE INVENTION 

[0009] It is therefore an object of the present invention to 
enable reproduction such as to maintain continuity of moving 
pictures in the playback domain. 

[0010] The present invention provides an information 
processing apparatus including a generator operable to generate, 
when continuous reproduction from a first AV stream to a second 
AV stream is commanded, a third AV stream and address information 
as information pertinent to the third AV stream, the third AV 
stream including a preset portion of the first AV stream and a 
preset portion of the second AV stream and being reproduced when 
reproduction is switched from the first AV stream to the second 
AV stream, the address information including information on an 
address of a source packet of the first AV stream at a time of 
switching of reproduction from the first AV stream to the third 
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AV stream, and information on an address of a source packet of 
the second AV stream at a time of switching of reproduction from 
the third AV stream to the second AV stream, and a recorder 
operable to record the third AV stream and the address 
information generated by the generator. 

[0011] Preferably, an arrival time stamp of the source packet 
of the first AV stream is continuous with an arrival time stamp 
of a source packet located at a leading end of the third AV 
stream, and an arrival time stamp of the source packet of the 
second AV stream is continuous with an arrival time stamp of a 
source packet located at a trailing end of the third AV stream. 
[0012] Preferably, a sole discontinuous point exists in the 
arrival time stamp of the source packet in the third AV stream. 
[0013] Preferably, the address is determined so that a data 
portion of an AV stream previous to a source packet specified by 
the information on the address of the source packet of the first 
AV stream is located in a continuous area of not less than a 
preset size on a recording medium. 

[0014] Preferably, the address is determined so that a data 
portion of an AV stream subsequent to a source packet specified 
by the information on the address of the source packet of the 
second AV stream is located in a continuous area of not less than 
a preset size on a recording medium. 

[0015] Preferably, the third AV stream is generated so that 
the third AV stream will be located in a continuous area of not 
less than a preset size on the recording medium. 

[0016] The present invention also provides an information 

generating method, including generating, when continuous 
reproduction from a first AV stream to a second AV stream is 



commanded, a third AV stream including a preset portion of the 
first AV stream and a preset portion of the second AV stream, the 
third AV stream being reproduced when reproduction is switched 
from the first AV stream to the second AV stream, and generating 
address information as information pertinent to the third AV 
stream, the address information including information on an 
address of a source packet of the first AV stream at a time of 
switching of reproduction from the first AV stream to the third 
AV stream, and information on an address of a source packet of 
jy_ the second AV stream at a time of switching of reproduction from 

Ji: the third AV stream to the second AV stream. 

[0017] The present invention also provides a recording medium 
■~: having recorded thereon a computer-readable program for 

yj generating information, in which the program includes generating, 

y, when continuous reproduction from a first AV stream to a second 

fn AV stream is commanded, a third AV stream including a preset 

portion of the first AV stream and a preset portion of the second 
F 4, AV stream, the third AV stream being reproduced when reproduction 

is switched from the first AV stream to the second AV stream, and 
generating address information as information pertinent to the 
third AV stream, the address information including information on 
an address of a source packet of the first AV stream at a time of 
switching of reproduction from the first AV stream to the third 
AV stream, and information on an address of a source packet of 
the second AV stream at a time of switching of reproduction from 
the third AV stream to the second AV stream. 

[0018] The present invention also provides a program for 
having a computer execute a program for generating information, 
in which the program includes generating, when continuous 



reproduction from a first AV stream to a second AV stream is 
commanded, a third AV stream including a preset portion of the 
first AV stream and a preset portion of the second AV stream, the 
third AV stream being reproduced when reproduction is switched 
from the first AV stream to the second AV stream, and generating 
address information as information pertinent to the third AV 
stream, the address information including information on an 
address of a source packet of the first AV stream at a time of 
switching of reproduction from the first AV stream to the third 
AV stream, and information on an address of a source packet of 
the second AV stream at a time of switching of reproduction from 
the third AV stream to the second AV stream. 

[0019] The present invention also provides an information 
processing apparatus, including a reproducing unit operable to 
reproduce a recording medium having recorded thereon a first AV 
stream, a second AV stream, a third AV stream including a preset 
portion of the first AV stream and a preset portion of the second 
AV stream, and address information as information pertinent to 
the third AV stream, the third AV stream being reproduced when 
reproduction is switched from the first AV stream to the second 
AV stream, the address information including information on an 
address of a source packet of the first AV stream at a time of 
switching of reproduction from the first AV stream to the third 
AV stream, and information on an address of a source packet of 
the second AV stream at a time of switching of reproduction from 
the third AV stream to the second AV stream; and a controller 
operable to control the reproducing unit for switching 
reproduction from the first AV stream read out from the first AV 
stream to the third AV stream and from the third AV stream to the 



second AV stream, based on the information pertinent to the third 
AV stream, read out from the second unit. 

[0020] The present invention also provides an information 
processing method, including reproducing a recording medium 
having recorded thereon a first AV stream, a second AV stream, a 
third AV stream including a preset portion of the first AV stream 
and a preset portion of the second AV stream, and address 
information as information pertinent to the third AV stream, the 
third AV stream being reproduced when reproduction is switched 
from the first AV stream to the second AV stream, the address 
information including information on an address of a source 
packet of the first AV stream at a time of switching of 
reproduction from the first AV stream to the third AV stream, and 
information on an address of a source packet of the second AV 
stream at a time of switching of reproduction from the third AV 
stream to the second AV stream; and controlling the reproducing 
step for switching reproduction from the first AV stream readout- 
controlled from the first readout controlling step to the third 
AV stream and from the third AV stream to the second AV stream, 
based on the information pertinent to the third AV stream, 
readout-controlled in the second readout controlling step. 
[0021] The present invention also provides a recording medium 
having recorded thereon a computer-readable program for 
processing information, in which the program includes reproducing 
a recording medium having recorded thereon a first AV stream, a 
second AV stream, a third AV stream including a preset portion of 
the first AV stream and a preset portion of the second AV stream, 
and address information as information pertinent to the third AV 
stream, the third AV stream being reproduced when reproduction is 



switched from the first AV stream to the second AV stream, the 
address information including information on an address of a 
source packet of the first AV stream at a time of switching of 
reproduction from the first AV stream to the third AV stream, and 
information on an address of a source packet of the second AV 
stream at a time of switching of reproduction from the third AV 
stream to the second AV stream; and controlling the reproducing 
step for switching reproduction from the first AV stream readout- 
controlled from the first readout controlling step to the third 

1:4, AV stream and from the third AV stream to the second AV stream, 

based on the information pertinent to the third AV stream, 

'% readout-controlled in the second readout controlling step. 

[0022] The present invention also provides a program for 

W having a computer execute a program for processing information, 

M : in which the program includes reproducing a recording medium 

ry 

having recorded thereon a first AV stream, a second AV stream, a 

U> 

third AV stream including a preset portion of the first AV stream 

Li 

and a preset portion of the second AV stream, and address 
information as information pertinent to the third AV stream, the 
third AV stream being reproduced when reproduction is switched 
from the first AV stream to the second AV stream, the address 
information including information on an address of a source 
packet of the first AV stream at a time of switching of 
reproduction from the first AV stream to the third AV stream, and 
information on an address of a source packet of the second AV 
stream at a time of switching of reproduction from the third AV 
stream to the second AV stream; and controlling the reproducing 
step for switching reproduction from the first AV stream readout- 
controlled from the first readout controlling step to the third 
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AV stream and from the third AV stream to the second AV stream, 
based on the information pertinent to the third AV stream, 
readout-controlled in the second readout controlling step. 
[0023] The present invention also provides a recording medium 
having recorded thereon address information, including, when 
continuous reproduction from a first AV stream to a second AV 
stream is commanded, a third AV stream made up of a preset 
portion of the first AV stream and a preset portion of the second 
AV stream, the third AV stream being reproduced when reproduction 
is switched from the first AV stream to the second AV stream; and 
address information as information pertinent to the third AV 
stream, the address information including information on an 
address of a source packet of the first AV stream at a time of 
switching of reproduction from the first AV stream to the third 
AV stream, and information on an address of a source packet of 
the second AV stream at a time of switching of reproduction from 
the third AV stream to the second AV stream. 

[0024] In the information processing method, apparatus, and 
program according to the present invention, if there is a command 
to perform reproduction continuously from the first AV stream to 
the second AV stream, there is generated a third AV stream 
including a preset portion of the first AV stream and a preset 
portion of the second AV stream, and the third AV stream is 
reproduced when reproduction is switched from the first AV stream 
to the second AV stream, while there is also generated, as 
information pertinent to the third AV stream, address information 
including information on an address of a source packet of the 
first AV stream at a time of switching of reproduction from the 
first AV stream to the third AV stream, and information on the 



address of a source packet of the second AV stream at a time of 
switching of reproduction from the third AV stream to the second 
AV stream. 

[0025] In the information processing method, apparatus, and 

program according to the present invention, a first AV stream, a 
second AV stream or a third AV stream is read out from a 
recording medium; address information, including information on 
an address of a source packet of the first AV stream at a time of 
switching of reproduction from the first AV stream to the third 
AV stream and information on an address of a source packet of the 
second AV stream at a time of switching of reproduction from the 
third AV stream to the second AV stream, is read out from the 
recording medium as information pertinent to the third AV stream; 
and reproduction is switched from the first AV stream to the 
third AV stream and from the third AV stream to the second AV 
stream as reproduction proceeds, based on the read-out 
information pertinent to the third AV stream. 

[0026] Other objects, features and advantages of the present 
invention will become more apparent from reading the embodiments 
of the present invention as shown in the drawings. 
BRIEF DESCRIPTION OF THE DRAWINGS 

[0027] Fig. 1 shows a configuration of an embodiment of a 
recording and/or reproducing apparatus according to the present 
invention. 

[0028] Fig. 2 illustrates the data format of data recorded on 
a recording medium by a recording and/or reproducing apparatus 1. 
[0029] Fig. 3 illustrates Real PlayList and Virtual PlayList. 
[0030] Figs. 4A, 4B and 4C illustrate the creation of the 

Real PlayList. 



[0031] Figs. 5A, 5B and 5C illustrate deletion of the Real 
PlayList . 

[0032] Figs. 6A and 6B illustrate assemble editing. 
[0033] Fig. 7 illustrates provision of a sub path in the 
Virtual PlayList. 

[0034] Fig. 8 illustrates the changing of the playback 
sequence of the PlayList. 

[0035] Fig. 9 illustrates a mark on the PlayList and a mark 
on the Clip. 

[0036] Fig. 10 illustrates a menu thumbnail. 
[0037] Fig. 11 illustrates a mark added to the PlayList. 
[0038] Fig. 12 illustrates a mark added to the Clip. 

[0039] Fig. 13 illustrates the relation between the PlayList, 
Clip and the thumbnail file. 



[0040] 


Fig. 


14 


illustrates a directory structure. 


[0041] 


Fig. 


15 


illustrates- a syntax of infr.dvr. 


[0042] 


Fig. 


16 


shows 


a syntax of DVRVolume . 


[0043] 


Fig . 


17 


shows 


a syntax of ResumeVolume . 


[0044] 


Fig. 


18 


shows 


a syntax of UIAppInf oVolume . 


[0045] 


Fig. 


19 


shows 


a table of character set values. 


[0046] 


Fig. 


20 


shows 


a syntax of TableOf PlayList . 


[0047] 


Fig. 


21 


shows 


another syntax of TableOf PlayList . 


[0048] 


Fig . 


22 


shows 


a syntax of the MakersPrivateData . 


[0049] 


Fig . 


23 


shows 


a syntax of xxxx.rpls and yyyy.vpls. 


[0050] 


Figs . 


24A to ; 


24C illustrate the PlayList. 


[0051] 


Fig. 


25 


shows 


a syntax of PlayList. 


[0052] 


Fig. 


26 


shows 


a table of PlayLis t_type . 


[0053] 


Fig. 


27 


shows 


a syntax of UIAppInf oPlayList . 


[0054] 


Figs . 




28A 


to 28C illustrate flags in the 
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UIAppInf oPlayList syntax shown in Fig. 27. 



[0055] 


Fig. 


29 


illustrates a Playltem. 


[0056] 


Fig. 


30 


illustrates a Playltem. 


[0057] 


Fig . 


31 


illustrates a Playltem. 


[0058] 


Fig. 


32 


shows a syntax of the Playltem. 


[0059] 


Fig. 


33 


illustrates IN-time. 


[0060] 


Fig. 


34 


illustrates OUT-time. 


[0061] 


Fig. 


35 


shows a table of Connection Condition 


[0062] 


Figs . 


36A to 36D illustrate Connection_Condit; 


[0063] 


Fig . 


37 


illustrates BridgeSequencelnf o . 


[0064] 


Fig. 


38 


shows a syntax of BridgeSequencelnf o . 


[0065] 


Fig . 


39 


illustrates SubPlayltem. 


[0066] 


Fig. 


40 


shows a syntax of SubPlayltem. 


[0067] 


Fig. 


41 


shows a table of Mark_type . 


[0068] 


Fig. 


42 


shows a syntax of PlayListMark . 


[0069] 


Fig. 


43 


shows a table of Mark type. 


[0070] 


Fig. 


44 


illustrates Mark time stamp. 


[0071] 


Fig. 


45 


shows a syntax of zzzzz.clip. 


[0072] 


Fig. 


46 


shows a syntax of Cliplnfo. 


[0073] 


Fig. 


47 


shows a table of Clip_stream_type . 


[0074] 


Fig . 


48 


illustrates offset SPN. 


[0075] 


Fig. 


49 


illustrates offset_SPN. 


[0076] 


Figs . 


50A, 50B illustrate the STC domain. 


[0077] 


Fig . 


51 


illustrates STC_Info. 


[0078] 


Fig. 


52 


shows a syntax of STC Info. 


[0079] 


Fig . 


53 


illustrates Programlnfo. 


[0080] 


Fig . 


54 


shows a syntax of Programlnfo. 


[0081] 


Fig. 


55 


shows a syntax of VideoCondinglnf o . 


[0082] 


Fig. 


56 


shows a table of Video_f ormat . 
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[0083] 


Fig . 


57 


shows a table of frame rate. 


[0084] 


Fig . 


58 


shows a table of display aspect ratio. 


[0085] 


Fig. 


59 


shows a syntax of AudioCondinglnf o . 


[0086] 


Fig. 


60 


shows a table of audio_coding . 


[0087] 


Fig . 


61 


shows a table of audio component type. 


[0088] 


Fig . 


62 


shows a table of sampling frequency. 


[0089] 


Fig. 


63 


illustrates CPI . 


[0090] 


Fig. 


64 


illustrates CPI. 


[0091] 


Fig. 


65 


shows a syntax of CPI . 


[0092] 


Fig . 


66 


shows a table of CPI_type. 


[0093] 


Fig. 


67 


illustrates video EP_map. 


[0094] 


Fig. 


68 


illustrates EP map. 


[0095] 


Fig. 


69 


illustrates EP map. 


[0096] 


Fig. 


70 


shows a syntax of EP map. 


[0097] 


Fig. 


71 


shows a table of EP typevalues . 


[0098] 


Fig. 


72 


shows a syntax of EP_map f or_one_stream PID. 


[0099] 


Fig. 


73 


illustrates TU_map. 


[0100] 


Fig. 


74 


shows a syntax of TU map. 


[0101] 


Fig. 


75 


shows a syntax of ClipMark. 


[0102] 


Fig . 


76 


shows a table of Mark type. 


[0103] 


Fig . 


77 


shows a table of Mark type stamp. 


[0104] 


Fig. 


78 


shows a syntax of menu.thmb and mark.thmb. 


[0105] 


Fig. 


79 


shows the syntax of thumbnail. 


[0106] 


Fig. 


80 


shows a table of thumbnail picture format. 


[0107] 


Figs . 


81A and 81B illustrate tn_block. 


[0108] 


Fig. 


82 


illustrates a structure of a transport stream 


of DVR 


MPEG2 . 






[0109] 


Fig. 


83 


shows a recorder model of a transport stream 


of DVR 


MPEG2 . 
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[0110] 


Fig. 


84 


shows a player model of a transport stream of 


DVR MPEG2 








[0111] 


Fig . 


85 


shows the syntax of a source packet. 


[0112] 


Fig . 


86 


shows the syntax of TP extra header. 


[0113] 


Fig. 


87 


shows a table of a copy permission indicator. 


[0114] 


Fig. 


88 


illustrates seamless connection. 


[0115] 


Fig. 


89 


illustrates seamless connection. 


[0116] 


Fig. 


90 


illustrates seamless connection. 


[0117] 


Fig. 


91 


illustrates seamless connection. 


[0118] 


Fig. 


92 


illustrates seamless connection. 


[0119] 


Fig. 


93 


illustrates audio overlap. 


[0120] 


Fig. 


9< 


1 illustrates seamless connection employing 


BridgeSequence . 






[0121] 


Fig . 


95 


illustrates seamless connection not employing 


BridgeSequence . 






[0122] 


Fig. 


96 


shows a DVR STD model. 


[0123] 


Fig. 


97 


is a timing chart for decoding and display. 


[0124] 


Fig. 


98 


shows another syntax of BridgeSequencelnf o . 


[0125] 


Fig. 


99 


illustrates Bridge_Clip when two Playltems are 



seamlessly connected. 

[0126] Fig. 100 shows a syntax of a Clip Information file. 

[0127] Fig. 101 shows a syntax of a Cliplnfo of the Clip 
Information file of Fig. 100. 

[0128] Fig. 102 shows a syntax of a Sequencelnfo of the Clip 

Information file of Fig. 100. 

[0129] Figs. 103A and 103B illustrate database change in case 

stream data of the Clip AV stream file is partially erased. 
[0130] Fig. 104 is a flowchart for illustrating the 

preparation of a RealPlayList . 
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[0131] Fig. 105 is a flowchart for illustrating the 
formulation of a Virtual PlayList. 

[0132] Fig. 106 is a flowchart for illustrating the 
formulation of a bridge sequence. 

[0133] Fig. 107 is a flowchart for illustrating the 
reproduction of PlayList. 

[0134] Fig. 108 illustrates a medium. 
DETAILED DESCRIPTION 

[0135] Referring to the drawings, an embodiment of the present 
invention will be explained in detail. Fig. 1 shows a typical 
inner structure of a recording and/or reproducing apparatus 1 
embodying the present invention. First, the structure of a 
recording unit 2, configured for recording signals input from 
outside, is explained. The recording and/or reproducing 
apparatus 1 is configured for being fed with and recording analog 
or digital data. 

[0136] Analog video signals and analog audio signals are fed 
to terminals 11, 12, respectively. The video signals, input to 
the terminal 11, are output to an analysis unit 14 and to an AV 
encoder 15. The audio signals, input to the terminal 12, are 
output to the analysis unit 14 and to the AV encoder 15. The 
analysis unit 14 extracts feature points, such as scene changes, 
from the input video and audio signals. 

[0137] The AV encoder 15 encodes input video and audio signals 

to output the system information (S) , such as an encoded video 
stream (V) , an encoded audio stream (A) and AV synchronization, 
to a multiplexer 16. 

[0138] The encoded video stream is a video stream encoded, 
e.g., with the MPEG (Motion Picture Expert Group) 2 system, 
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whilst the encoded audio stream is an audio stream encoded in 
accordance with the MPEG1 system, with the encoded audio stream 
being, e.g., an audio stream encoded in, e.g., the MPEG1 system 
or an audio stream encoded in accordance with the Dolby AC 3 
(trademark) system. The multiplexer 16 multiplexes the input 
video and audio streams, based on the input system information, 
to output a multiplexed stream through a switch 17 to a 
multiplexed stream analysis unit 18 and to a source packetizer 
19. 

[0139] The multiplexed stream is, e.g., an MPEG-2 transport 
stream or an MPEG2 program stream. The source packetizer 19 
encodes the input multiplexed stream into an AV stream composed 
of source packets in accordance with an application format of a 
recording medium 100 on which to record the stream. The AV 
stream is processed in ECC (error correction and coding) unit 20 
and a modulation unit 21 with appendage of ECC codes and with 
modulation, before being output to a write unit 22, which then 
writes (records) an AV stream file based on control signals 
output by the controller 23. 

[0140] The transport stream, such as digital television 

broadcast, input from a digital interface or a digital television 
tuner, is input to a terminal 13. There are two recording 
systems for recording the transport stream input to the terminal 
13, one being a transparent recording system and the other being 
a system in which recording is preceded by re-encoding aimed to 
lower, e.g., the recording bit rate. The recording system 
command information is input from a terminal 2 4 as a user 
interface to the controller 23. 

[0141] In the transparent recording of the input transport 



stream, a transport stream, input to terminal 13, is output 
through switch 17 to the multiplexed stream analysis unit 18 and 
to the source packetizer 19. The ensuing process of recording an 
AV stream on a recording medium is the same as that of encoding 
and recording analog input audio and video signals, as described 
above, and hence is not explained here for simplicity. 
[0142] If the input transport stream is re-encoded and 
subsequently recorded, the transport stream, input to the 
terminal 13, is fed to a demultiplexer 26, which demultiplexes 
the input transport stream to extract a video stream (V) , an 
audio stream (A) and the system information (S) . 
[0143] Of the stream (information) , as extracted by the 
demultiplexer 2 6, the video stream is output to an audio decoder 
27, whilst the audio stream and the system information are output 
to the multiplexer 16. The audio decoder 27 decodes the input 
transport stream to output the encoded video stream (V) to the 
multiplexer 16. 

[0144] The audio stream and the system information, output 
from the demultiplexer 26 and input to the multiplexer 16, and 
the video stream, output by the AV encoder 15, are multiplexed, 
based on the input system information, and output to the 
multiplexed stream analysis unit 18 and to the source packetizer 
19 through switch 17, as a multiplexed stream. The ensuing 
process of recording an AV stream on a recording medium is the 
same as that of encoding and recording analog input audio and 
video signals, as described above, and hence is not explained 
here for simplicity. 

[0145] The recording and/or reproducing apparatus 1 of the 
present embodiment records a file of the AV stream on the 
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recording medium 100, while also recording the application 
database information which accounts for the file. The input 
information to the controller 23 is the feature information for 
the moving picture from the analysis unit 14, the feature 
information of the AV stream from the multiplexed stream analysis 
unit 18 and the user command information input at terminal 24. 
[0146] The feature information of the moving picture, supplied 
from the analysis unit 14, is generated by the analysis unit 14 
when the AV encoder 15 encodes video signals. The analysis unit 
14 analyzes the contents of the input video and audio signals to 
generate the information pertinent to the pictures characteristic 
of the input moving picture signals (clip mark) . This 
information is the information indicating a picture of 
characteristic clip mark points, such as program start points, 
scene change points, CM commercial start and end points, title or 
telop in input video signals, and also includes a thumbnail of 
the picture and the information pertinent to stereo/monaural 
switching points and muted portions of audio signals. 
[0147] The above picture indicating information is fed through 
controller 23 to the multiplexer 16. When multiplexing an 
encoded picture specified as clip mark by the controller 23, the 
multiplexer 16 returns the information for specifying the encoded 
picture on the AV stream to the controller 23. Specifically, 
this information is the PTS (presentation time stamp) of a 
picture or the address information on the AV stream of an encoded 
version of the picture. The controller 23 stores the sort of 
feature pictures and the information for specifying the encoded 
picture on the AV stream in association with each other. 
[0148] The feature information of the AV stream from the 
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multiplexed stream analysis unit 18 is the information pertinent 
to the encoding information of the AV stream to be recorded, and 
is recorded by the analysis unit 18. For example, the feature 
information includes the time stamp and address information of 
the I-picture in the AV stream, discontinuous point information 
of system time clocks, encoding parameters of the AV stream and 
change point information of the encoding parameters in the AV 
stream. When transparently recording the transport stream input 
from the terminal 13, the multiplexed stream analysis unit 18 
detects the picture of the aforementioned clip mark from the 
input transport stream, and generates the information for 
specifying a picture designated by the clip mark and its type. 
[0149] The user designation information from the terminal 24 
is the information specifying the playback domain, designated by 
the user, character letters for explaining the contents of the 
playback domain, or the information such as bookmarks or resuming 
points set by the user for his or her favorite scene. 
[0150] Based on the aforementioned input information, the 
controller 23 creates a database of the AV stream (Clip) , a 
database of a group (PlayList) of playback domains (Playltem) of 
the AV stream, management information of the recorded contents of 
the recording medium 100 (info.dvr) and the information on 
thumbnail pictures. Similarly to the AV stream, the application 
database information, constructed from the above information, is 
processed in the ECC unit 20 and the modulation unit 21 and input 
to the write unit 22, which then records a database file on the 
recording medium 100. 

[0151] The above-described application database information 
will be explained subsequently in detail. 
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[0152] When the AV stream file recorded on the recording 
medium 100 (files of picture data and speech data) and the 
application database information, thus recorded on the recording 
medium 100, are reproduced by a reproducing unit 3, the 
controller 23 first commands a readout unit 2 8 to read out the 
application database information from the recording medium 100. 
The readout unit 28 reads out the application database 
information from the recording medium 100, and sends the 
application database information through demodulation and error 
correction processing by a demodulating unit 29 and an ECC 
decoder 30 to the controller 23. 

[0153] Based on the application database information, the 
controller 23 outputs a list of PlayList recorded on the 
recording medium 100 to a user interface of the terminal 24. The 
user selects the PlayList desired to be reproduced from the list 
of PlayLists. The information pertinent to PlayList, specified 
to be reproduced, is input to the controller 23. The controller 
23 commands the readout unit 2 8 to read out the AV stream file 
necessary in reproducing the PlayList. In accordance with the 
command, the readout unit 28 reads out the corresponding AV 
stream from the recording medium 100 to output the read-out AV 
stream to the demodulating unit 29. The AV stream, thus input to 
the demodulating unit 29, is demodulated by preset processing and 
output through the processing by the ECC decoder 30 to a source 
depacketizer 31. 

[0154] The source depacketizer 31 converts the AV stream of 
the application format, read out from the recording medium 100 
and processed in a preset fashion, into a stream processable by 
the demultiplexer 26. The demultiplexer 26 outputs the system 
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information (S) , -such as the video stream (V), audio stream (A) 
or the AV synchronization, forming the playback domain (Playltem) 
of the AV stream specified by the controller 23, to the audio 
decoder 27, which AV decoder 27 decodes the video stream and the 
audio stream to output the playback video signal and the playback 
audio signal to associated terminals 32, 33, respectively. 
[0155] If fed from the terminal 24, as a user interface, with 

information instructing random access playback or special 
playback, the controller 23 determines the readout position of 
the AV stream from the recording medium 100, based on the 
contents of the database (Clip) of the AV stream, to command the 
readout unit 28 to read out the AV stream. If the PlayList as 
selected by the user is to be reproduced as from a preset time 
point, the controller 23 commands the readout unit 28 to read out 
data from an I-picture having a time stamp closest to the 
specified time point. 

[0156] When the user has selected a certain clip mark from 
indexing points or scene change points for the program stored in 
the ClipMark in the Clip Information, as when the user selects a 
certain picture from a list of thumbnail pictures, as 
demonstrated on a user interface, of the indexing points or scene 
change points stored in the ClipMark, the controller 23 
determines the AV stream readout position from the recording 
medium 100 to command the readout unit 2 8 to read out the AV 
stream. That is, the controller 23 commands the readout unit 28 
to read out data from an I-picture having an address closest to 
the address on the AV stream which has stored the picture 
selected by the user. The readout unit 28 reads out data from 
the specified address. The read-out data is processed by the 
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demodulating unit 29, ECC decoder 30 and by the source packetizer 
19 so as to be supplied to the demultiplexer 26 and decoded by 
the audio decoder 27 to reproduce AV data indicated by an address 
of the mark point picture. 

[0157] If the user has commanded fast forward playback, the 
controller 23 commands the readout unit 28 to sequentially read 
out I-picture data in the AV stream in succession based on the 
database (Clip) of the AV stream. 

[0158] The readout unit 28 reads out data of the AV stream 
from a specified random access point. The so read-out data is 
reproduced through processing by various components on the 
downstream side. 

[0159] The case in which the user edits the AV stream recorded 
on the recording medium 100 is now explained. If desired to 
specify a playback domain for the AV stream recorded on the 
recording medium 100, for example, if desired to create a 
playback route of reproducing a portion sung by a singer A from a 
song program A, and subsequently reproducing a portion sung by 
the same singer A from another song program B, the information 
pertinent to a beginning point (IN-point) and an end point (OUT- 
point) of the playback domain is input to the controller 23 from 
the terminal as a user interface. The controller 23 creates a 
database of the group (PlayList) of playback domains (Playltem) 
of the AV streams. 

[0160] When the user desires to erase a portion of the AV 
stream recorded on the recording medium 100, the information 
pertinent to the IN-point and the OUT-point of the erasure domain 
is input to the controller 23, which then modifies the database 
of the PlayList so as to refer to only the needed AV streams. 
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The controller 23 also commands the write unit 22 to erase an 
unneeded stream portion of the AV stream. 

[0161] The case in which the user desires to specify playback 
domains of an AV stream recorded on the recording medium to 
create a new playback route, and to interconnect the respective 
playback domains in a seamless fashion, is now explained. In 
such case, the controller 23 creates a database of a group 
(PlayList) of the playback domains (Playltem) of the AV stream 
and undertakes to partially re-encode and re-multiplex the video 
stream in the vicinity of junction points of the playback 
domains . 

[0162] The picture information at the IN-point and that at the 

OUT-point of a playback domain are input from a terminal 2 4 to a 
controller 23. The controller 23 commands the readout unit 28 to 
read out data needed to reproduce the pictures at the IN-point 
and at the OUT-point. The readout unit 2 8 reads out data from 
the recording medium 100. The data so read out is output through 
the demodulating unit 29, ECC decoder 30 and the source 
packetizer 19 to the demultiplexer 26. 

[0163] The controller 23 analyzes data input to the 
demultiplexer 2 6 to determine the re-encoding method for the 
video stream (change of picture_coding_type and assignment of the 
quantity of encoding bits for re-encoding) and the re- 
multiplexing . system to send the system to the AV encoder 15 and 
to the multiplexer 16. 

[0164] The demultiplexer 2 6 then separates the input stream 

into the video stream (V) , audio stream (A) and the system 
information (S) . The video stream may be classed into data input 
to the audio decoder 27 and data input to the multiplexer 16. 



The former is data needed for re-encoding, and is decoded by the 
audio decoder 27, with the decoded picture being then re-encoded 
by the AV encoder 15 and thereby caused to become a video stream. 
The latter data is data copied from an original stream without 
re-encoding. The audio stream and the system information are 
directly input to the multiplexer 16. 

[0165] The multiplexer 16 multiplexes an input stream, based 
on the information input from the controller 23, to output a 
multiplexed stream, which is processed by the ECC unit 20 and the 
modulation unit 21 so as to be sent to the write unit 22. The 
write unit 22 records an AV stream on the recording medium 100 
based on the control signals supplied from the controller 23. 

[0166] The application database information and the operation 
based on this information, such as playback and editing, are 
hereinafter explained. Fig. 2 shows the structure of an 
application format having two layers, that is PlayList and Clip, 
for AV stream management. The Volume Information manages all 
Clips and PlayLists in the disc. Here, one AV stream and the 
ancillary information thereof, paired together, is deemed to be 
an object, and is termed Clip. The AV stream file is termed a 
Clip AV stream file, with the ancillary information being termed 
the Clip Information file. 

[0167] One Clip AV stream file stores data corresponding to an 
MPEG-2 transport stream arranged in a structure prescribed by the 
application format. By and large, a file is treated as a byte 
string. The contents of the Clip AV stream file are expanded on 
the time axis, with entry points in the Clip (I-picture) being 
mainly specified on the time basis. When a time stamp of an 
access point to a preset Clip is given, the Clip Information file 



is useful in finding the address information at which to start 
data readout in the Clip AV stream file. 

[0168] Referring to Fig. 3, PlayList is now explained, which 

is provided for a user to select a playback domain desired to be 
viewed from the Clip and to edit the playback domain readily. 
One PlayList is a set of playback domains in the Clip. One 
playback domain in a preset Clip is termed Playltem and is 
represented by a pair of an IN-point and an OUT-point on the time 
axis. So, the PlayList is formed by a set of plural Playltems . 

[0169] The PlayList is classified into two types, one of which 
is Real PlayList and the other of which is Virtual PlayList. The 
Real PlayList co-owns stream portions of the Clip it is 
referencing. That is, the Real PlayList takes up in the disc the 
data capacity corresponding to a stream portion of the Clip it is 
referencing and, when Real PlayList is erased, the data of the 
stream portion of the Clip it is referencing is also erased. 

[0170] The Virtual PlayList is not co-owning Clip data. 

Therefore, if the Virtual PlayList is changed or erased, the 
contents of the Clip are in no way changed. 

[0171] The editing of the Real Playlist is explained. Fig. 4A 

shows creation of Real PlayList and, if the AV stream is recorded 
as a new Clip, the Real PlayList which references the entire Clip 
is a newly created operation. 

[0172] Fig. 4B shows the division of the Real PlayList, that 

is the operation of dividing the Real PlayList at a desired point 
to split the Real PlayList into two Real PlayLists . This 
division operation is performed when two programs are managed in 
one clip managed by a sole PlayList and when the user intends to 
re-register or re-record the programs as separate individual 
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programs. This operation does not lead to alteration of the Clip 
contents, that is, to division of the Clip itself. 
[0173] Fig. 4C shows the operation of combining the Real 
PlayList which is the operation of combining two Real PlayLists 
into one new Real PlayList. This combining operation is 
performed such as when the user desires to re-register two 
programs as a sole program. This operation does not lead to 
alteration of the Clip contents, that is, to combining the Clip 
itself into one. 

[0174] Fig. 5A shows deletion of the entire Real PlayList. In 
the operation of erasing the entire preset Real PlayList, the 
associated stream portion of the Clip referenced by the deleted 
Real PlayList is also deleted. 

[0175] Fig. 5B shows partial deletion of the Real PlayList. 
If a desired portion of the Real PlayList is deleted, the 
associated Playltem is altered to reference only the needed Clip 
stream portion. The corresponding stream portion of the Clip is 
deleted . 

[0176] Fig. 5C shows the minimizing of the Real PlayList. 

This is an operation of causing the Playltem associated with the 
Real PlayList to reference only the stream portion of the Clip 
needed for Virtual PlayList. The corresponding stream portion of 
the Clip not needed for the Virtual PlayList is deleted. 
[0177] If. the Real PlayList is changed by the above-described 
operation such that the stream portion of the Clip referenced by 
the Real PlayList is deleted, there is a possibility that the 
Virtual PlayList employing the deleted Clip is present such that 
problems may be produced in the Virtual PlayList due to the 
deleted Clip. 



[0178] In order to prevent this from occurring, such a message 
which runs: "If there exists the Virtual PlayList referencing the 
stream portion of the Clip the Real PlayList is referencing, and 
the Real PlayList is deleted, the Virtual PlayList itself is 
deleted - is it all right?" is displayed for the user in response 
to the user's operation of deletion by way of confirmation or 
alarming, after which the processing for deletion is executed or 
cancelled subject to the user's commands. Alternatively, the 
minimizing operation for the Real PlayList is performed in place 
of deleting the Virtual PlayList. 

[0179] The operation for the Virtual PlayList is now 
explained. If an operation is performed for the Virtual 
PlayList, the contents of the Clip are not changed. Figs. 6A 
and 6B show the assembling and editing (IN-OUT editing) . This is 
an operation of creating Playltem of the playback domain the user 
has desired to view to create Virtual PlayList. The seamless 
connection between Playltems is supported by the application 
format, as later explained. 

[0180] If there exist two Real PlayLists 1, 2 and clips 1, 2 
associated with the respective Real PlayLists, the user specifies 
a preset domain in the Real PlayList 1 (domain from INI to OUT1: 
Playltem 1) as the playback domain, and also specifies, as the 
domain to be reproduced next, a preset domain in the Real 
PlayList 2 (domain from IN2 to 0UT2 : Playltem 2) as the playback 
domain, as shown in Fig. 6A, and a sole Virtual PlayList made up 
of Playltem 1 and the Playltem2 is prepared, as shown in Fig. 6B . 
[0181] The re-editing of the Virtual PlayList is now 
explained. The re-editing may be enumerated by alteration of IN 
or OUT points in the Virtual PlayList, insertion or appendage of 
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new PI ay I terns to the Virtual PlayList and deletion of Playltems 
in the Virtual PlayList. The Virtual PlayList itself may also be 
deleted . 

[0182] Fig. 7 shows the audio dubbing (post recording) to the 

Virtual PlayList. This is an operation of registering the audio 
post recording to the Virtual PlayList as a sub path. This audio 
post recording is supported by the application software. An 
additional audio stream is added as a sub path to the AV stream 
of the main path of the Virtual PlayList. 

[0183] Common to the Real PlayList and the Virtual PlayList is 

an operation of changing (moving) the playback sequence of the 
PlayList shown in Fig. 8. This operation is an alteration of 
the playback sequence of the PlayList in the disc (volume) and is 
supported by TableOf PlayList as defined in the application 
format, as will be explained subsequently with reference to, 
e.g., Fig. 20. This operation does not lead to alteration of 
the Clip contents. 

[0184] The mark (Mark) is now explained. The mark is provided 
for specifying a highlight or characteristic time in the Clip and 
in the PlayList, as shown in Fig. 9. The mark added to the Clip 
is termed the ClipMark. The ClipMark is, e.g., a program 
indexing point or a scene change point for specifying a 
characteristic scene ascribable to contents in the AV stream. 
The ClipMark is generated by, e.g., the analysis unit 14 of Fig. 
1. When the PlayList is reproduced, the mark of the Clip 
referenced by the PlayList may be referenced and used. 
[0185] The mark appended to the PlayList is termed the 
PlayListMark (play list mark). The PlayListMark is, e.g., a 
bookmark point or a resuming point as set by the user. The mark 
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is set to the Clip and to the PlayList by adding a time stamp 
indicating the mark time point to the mark list. On the other 
hand, mark deletion is removing the time stamp of the mark from 
the mark list. Consequently, the AV stream is in no way changed 
by mark setting or by mark deletion. 

[0186] As another format of the ClipMark, a picture referenced 
by the ClipMark may be specified on the address basis in the AV 
stream. Mark setting on the Clip is by adding the address basis 
information indicating the picture of the mark point to the mark 
list. On the other hand, mark deletion is removing the address 
basis information indicating the mark point picture from the mark 
list. Consequently, the AV stream is in no way changed by mark 
setting or by mark deletion. 

[0187] A thumbnail is now explained. The thumbnail is a still 
picture added to the Volume, PlayList and Clip. There are two 
sorts of the thumbnail, one of them being a thumbnail as a 
representative picture indicating the contents . This is mainly 
used in a main picture in order for the user to select what he or 
she desired to view on acting on a cursor, not shown. Another 
sort of the thumbnail is a picture indicating a scene pointed to 
by the mark. 

[0188] The Volume and the respective PlayLists need to own 

representative pictures. The representative pictures of the 
Volume are presupposed to be used for initially demonstrating a 
still picture representing the disc contents when the disc is set 
in position in the recording and/or reproducing apparatus 1. It 
is noted that the disc means the recording medium 100 which is 
presupposed to be of a disc shape. The representative picture of 
the PlayList is presupposed to be used as a still picture for 



representing PlayList contents. 

[0189] As the representative picture of the PlayList, it may 
be contemplated to use the initial picture of the PlayList as the 
thumbnail (representative picture) . However, the leading picture 
at the playback time of 0 is not necessarily an optimum picture 
representing the contents. So, the user is allowed to set an 
optional picture as a thumbnail of the PlayList. Two sorts of 
the thumbnails, that is, the thumbnail as a representative 
picture indicating the Volume and the thumbnail as a 
representative picture indicating PlayList, are termed menu 
thumbnails. Since the menu thumbnails are demonstrated 
frequently, these thumbnails need to be read out at an elevated 
speed from the disc. Thus, it is efficient to store the totality 
of the menu thumbnails in a sole file. It is unnecessary for the 
menu thumbnails to be pictures extracted from the moving pictures 
in the Volume, but may be a picture captured from a personal 
computer or a digital still camera, as shown in Fig. 10. 

[0190] On the other hand, the Clip and the PlayList need to be 

marked with plural marks, whilst the pictures of the mark points 
need to be readily viewed in order to grasp the contents of the 
mark positions. The picture indicating such mark point is 

termed a mark thumbnail. Therefore, the picture which is the 
original of the mark thumbnail is mainly an extracted mark point 
picture rather than a picture captured from outside. 
[0191] Fig. 11 shows the relation between the mark affixed to 
the PlayList and the mark thumbnail, whilst Fig. 12 shows the 
relation between the mark affixed to the Clip and the mark 
thumbnail. In distinction from the menu thumbnail, the mark 
thumbnail is used in, e.g., a sub-menu for representing details 
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of the PlayList, while it is not requested to be read out in a 
short access time. So, whenever a thumbnail is required, the 
recording and/or reproducing apparatus 1 opens a file and reads 
out a portion of the file, while there is no problem presented 
even if file opening and reading out a portion of the file by the 
recording and/or reproducing apparatus 1 takes some time. 
[0192] For decreasing the number of files present in a volume, 
it is preferred for the totality of the mark thumbnails to be 
stored in one file. While the PlayList may own one menu 
thumbnail and plural mark thumbnails, the user is not required to 
select the Clip directly (usually, the Clip is selected through 
PlayList) , and hence there is no necessity of providing menu 
thumbnails . 

[0193] Fig. 13 shows the relation between the menu 
thumbnails, mark thumbnails, PlayList and Clips. In the menu 
thumbnail file are filed menu thumbnails provided from one 
PlayList to another. In the menu thumbnail file is contained a 
volume thumbnail representing the contents of data recorded on 
the disc. In the menu thumbnail file are filed thumbnails 
created from one PlayList to another and from one Clip to 
another. 

[0194] The CPI (Characteristic Point Information) is 
hereinafter explained. The CPI is data contained in the Clip 
information file and is used mainly for finding a data address 
in the Clip AV stream file at which to start the data readout 
when a time stamp of the access point to the Clip is afforded. 
In the present embodiment, two sorts of the CPI are used, one of 
them being EP_map and the other being TU__map . 

[0195] The EP_map is a list of entry point (EP) data extracted 
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from the elementary stream and the transport stream. This has 
the address information used to find the site of entry points in 
the AV stream at which to start the decoding. One EP data is 
made up of a presentation time stamp (PTS) and a data address in 
the AV stream of the accessing unit associated with the PTS, with 
the data address being paired to the PTS. 

[0196] The EP_map is used mainly for two purposes. First, it 

is used for finding a data address in the AV stream in the 
accessing unit referenced by the PTS in the PlayList. Second, 
the EP__map is used for fast forward playback or fast reverse 
playback. If, in recording the input AV stream by the recording 
and/or reproducing apparatus 1, the syntax of the stream can be 
analyzed, the EP_map is created and recorded on the disc. 
[0197] The TU_map has a list of time unit (TU) data which is 
derived from the arrival time point of the transport packet input 
through a digital interface. This affords the relation between 
the arrival-time-based time and the data address in the AV 
stream. When the recording and/or reproducing apparatus 1 
records an input AV stream, and the syntax of the stream cannot 
be analyzed, a TU_map is created and recorded on the disc. 
[0198] The STCInfo stores the discontinuous point information 

in the AV stream file which stores the MPEG-2 transport stream. 
[0199] When the AV stream has discontinuous points of STC, the 
same PTS values may appear in the AV stream file. Thus, if a 
time point in the AV stream is specified on the PTS basis, the 
PTS of the access point is insufficient to specify the point. 
Moreover, an index of the continuous STC domain containing the 
PTS is required. In this format, the continuous STC domain and 
its index are termed an STC-sequence and STC-sequence-id, 
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respectively. The STC-sequence information is defined by the 
STCInfo of the Clip Information file. 

[0200] The STC-sequence-id is used in an AV stream file and is 
optional in the AV stream file having the TU_map . 

[0201] The programs are each a collection of elementary 
streams and co-own a sole system time base for synchronized 
reproduction of these streams. 

[0202] It is useful for a reproducing apparatus (recording 
and/or reproducing apparatus 1 of Fig. 1) to know the contents of 
an AV stream prior to its decoding. These contents include, 
e.g., values of the PID of a transport packet transmitting an 
audio or video elementary stream or the type of the video or 
audio components, such as HDTV video or MPEG-2 AAC audio stream. 
This information is useful for creating a menu screen for 
illustrating to the user the contents of the PlayList referencing 
the AV stream. It is similarly useful for setting the initial 
state of the AV decoder and the demultiplexer of the respective 
apparatus. For this reason, the Clip Information file owns 
Programlnfo for illustrating the program contents. 
[0203] It may be an occurrence that program contents are 
changed in the AV stream file in which the MPEG-2 transport 
stream is stored. For example, the PID of the transport packet 
transmitting the video elementary stream may be changed, or the 
component type of the video stream may be changed from SDTV to 
HDTV. 

[0204] The Programlnfo stores the information on change points 

of program contents in the AV stream file. The domain of the AV 
stream file in which the program contents remain constant is 
termed program-sequence. This program-sequence is used in an AV 
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stream file having EP_map and is optional in an AV stream file 
having TU_map. 

[0205] The present embodiment defines the self-encoding stream 
format (SESF) . The SESF is used for encoding analog input 
signals and for decoding digital input signals for subsequently 
encoding the decoded signal into an MPEG-2 transport stream. 
[0206] The SESF defines an elementary stream pertinent to the 
MPEG-2 transport stream and the AV stream. When the recording 
and/or reproducing apparatus 1 encodes and records input signals 
in the SESF, an EP_map is created and recorded on the disc. 
[0207] A digital broadcast stream uses one of the following 
systems for recording on the recording medium 100: first, the 
digital broadcast stream is transcoded into an SESF stream. In 
this case, the recorded stream must conform to SESF and EP_map 
must be prepared and recorded on the disc. 

[0208] Alternatively, an elementary stream forming a digital 
broadcast stream is transcoded to a new elementary stream and re- 
multiplexed to a new transport stream conforming to the stream 
format prescribed by the organization for standardizing the 
digital broadcast stream. In this case, an EP_map must be 
created and recorded on the disc. 

[0209] For example, it is assumed that the input stream is an 
MPEG-2 transport stream conforming to the ISDB (standard 
appellation of digital BS of Japan) , with the transport stream 
containing the HDTV video stream and the MPEG AAC audio stream. 
The HDTV video stream is transcoded to an SDTV video stream, 
which SDTV video stream and the original AAC audio stream are re- 
multiplexed to TS. The SDTV stream and the transport stream both 
need to conform to the ISDB format. 



34 



[0210] Another system of recording the digital broadcast 
stream on the recording medium 100 is to make transparent 
recording of the input transport stream, that is to record the 
input transport stream unchanged, in which case the EP_map is 
formulated and recorded on the disc. 

[0211] Alternatively, the input transport stream is recorded 
transparently, that is, an input transport stream is recorded 
unchanged, in which case TU_map is created and recorded on the 
disc . 

[0212] The directory and the file are hereinafter explained. 
The recording and/or reproducing apparatus 1 is hereinafter 
described as DVR (digital video recording) . Fig. 14 shows a 
typical directory structure on the disc. The directories of the 
disc of the DVR may be enumerated by a root directory including 
"DVR" directory, and the "DVR" directory, including "PLAYLIST" 
directory, "CLIPINF" directory, "M2TS" directory and "DATA" 
directory, as shown in Fig. 14. Although directories other than 
these may be created below the root directory, these are 
discounted in the application format of the present embodiment. 
[0213] Below the "DATA" directory, there are stored all files 
and directories prescribed by the DVR application format. The 
"DVR" directory includes four directories. Below the "PLAYLIST" 
directory are placed Real PlayList and Virtual PlayList database 
files. The flatter directory may exist in a state devoid of 
PlayList . 

[0214] Below "CLIPINF" is placed a Clip database. This 
directory, too, may exist in a state devoid of AV stream files. 
In the "DATA" directory, there are stored files of data 
broadcasts, such as digital TV broadcasts. 
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[0215] The " DVR" directory stores the following files: an 
"info.dvr" is created below the DVR directory to store the 
comprehensive information of an application layer. Below the DVR 
directory, there must be a sole info.dvr. The filename is 
assumed to be fixed to info.dvr. The "menu.thmb" stores the 
information pertinent to the menu thumbnails. Below the DVR 
directory, there must be 0 or 1 menu thumbnail. The filename is 
assumed to be fixed to "menu.thmb". If there is no menu 
thumbnail, this file may not exist. 

[0216] The "mark.thmb" file stores the information pertinent 
to the mark thumbnail picture. Below the DVR directory, there 
must be 0 or 1 mark thumbnail. The filename is assumed to be 
fixed to "mark.thmb". If there is no mark thumbnail, this file 
may not exist. 

[0217] The " PLAYLIST" directory stores two sorts of the 
PlayList files which are Real PlayList and Virtual PlayList. An 
"xxxxx.rpls" file stores the information pertinent to one Real 
PlayList. One file is created for each Real PlayList. The 
filename is "xxxxx.rpls", where "xxxxx" denotes five numerical 
figures from 0 to 9 . A file extender must be "rpls". 
[0218] The "yyyyy.vpls" stores the information pertinent to 

one Virtual PlayList. One file with a filename "yyyyy.vpls" is 
created from one Virtual PlayList to another, where "yyyyy" 
denotes five numerical figures from 0 to 9 . A file extender must 
be "vpls". 

[0219] The "CLIPINF" directory stores one file in association 

with each AV stream file. The "zzzzz.clpi" is a Clip Information 
file corresponding to one AV stream file (Clip AV stream file or 
Bridge-Clip stream file) . The filename is "zzzzz.clpi", where 
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"zzzzz" denotes five numerical figures from 0 to 9. A file 
extender must be "dpi". 

[0220] The "M2TS " directory stores an AV stream file. The 
"zzzzz. m2ts" file is an AV stream file handled by the DVR system. 
This is a Clip AV stream file or a Bridge-Clip AV stream file. 
The filename is "zzzzz .m2ts" , where "zzzzz" denotes five 
numerical figures from 0 to 9 . A file extender must be "m2ts". 
[0221] The "DATA" directory stores data transmitted from data 

broadcasting. This data may, for example, be XML or MPEG files. 
[0222] The syntax and the semantics of each directory (file) 
are now explained. Fig. 15 shows the syntax of the "info.dvr" 
file. The "info.dvr" file is made up of three objects, that is 
DVRVolume ( ) , TableOf PlayLists ( ) and MakersPrivateData () . 
[0223] The syntax of info.dvr shown in Fig. 15 is explained. 
The TableOfPlayLists_Start_address indicates the leading address 
of the TableOf PlayLists ( ) in terms of the relative number of 
bytes from the leading byte of the "info.dvr" file. The 
relative number of bytes is counted beginning from 0. 
[0224] The MakersPrivateData_Start_address indicates the 
leading address of the MakersPrivateData () , in terms of the 
relative number of bytes as from the leading byte of the 
"info.dvr" file. The relative number of bytes is counted from 0. 
The padding_word is inserted in association with the syntax of 
"info.dvr". . Nl and N2 are optional positive integers. Each 
padding word may assume an optional value. 

[0225] The DVRVolume ( ) stores the information stating the 
contents of the volume (disc). The syntax of the DVRVolume (), 
shown in Fig. 16, is now explained. The version_number indicates 
four character letters indicting the version numbers of the 



DVRVolume <). The version_number is encoded to "0045" in 
association with IS0646. 

[0226] Length is denoted by 32-bit unsigned integers 
indicating the number of bytes from directly after the length 
field to the trailing end of DVRVolume ( ) . 

[0227] The ResumeVolume ( ) memorizes the filename of the Real 
PlayList or the Virtual PlayList reproduced last in the Volume. 
However, the playback position when the user has interrupted 
playback of the Real PlayList or the Virtual PlayList is stored 
in the resume-mark defined in the PlayLis tMark ( ) (see Figs. 42 
and 43) . 

[0228] The syntax of the ResumeVolume () , shown in Fig. 17, is 

explained. The valid_flag indicates that the 

-resume_PlayList_name field is valid or invalid when this 1-bit 
flag is set to 1 or 0, respectively. The 10-byte field of 
resume_PlayList_name indicates the filename of the Real PlayList 
or the Virtual PlayList to be resumed. 

[0229] The UIAppInf oVolume ( ) in the syntax of the DVRVolume (), 
shown in Fig. 16, stores parameters of the user interface 
application concerning the Volume. Fig. 18 shows the syntax of 
the UIAppInf oVolume () , the semantics of which are now explained. 
The 8-bit field of character_set indicates the encoding method 
for character letters encoded in the Volume_name field. The 
encoding method corresponds to the values shown in Fig. 19. 
[0230] The 8-bit field of the name_length indicates the byte 
length of the Volume name indicated in the Volume_name field. 
The Volume_name field indicates the appellation of the Volume. 
The number of bytes of the number of the name_length counted from 
left of the field is the number of valid characters and indicates 
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the volume appellation. The values next following these valid 
character letters may be any values. 

[0231] The Volume_protect_f lag is a flag indicating whether or 
not the contents in the Volume can be shown to the user without 
limitations. If this flag is set to 1, the contents of the 
Volume are allowed to be presented (reproduced) to the user only 
in case the user has succeeded in correctly inputting the PIN 
number (password) . If this flag is set to 0, the contents of the 
Volume are allowed to be presented to the user even in case the 
PIN number is not input by the user. 

[0232] If, when the user has inserted a disc into a player, 
this flag has been set to 0, or the flag is set to 1 but the user 
has succeeded in correctly inputting the PIN number, the 
recording and/or reproducing apparatus 1 demonstrates a list of 
the PlayList in the disc. The limitations on reproduction of the 
respective PlayLists are irrelevant to the Volume_protect_f lag 
and are indicated by playback_control_f lag defined in the 
UIAppInf oVolume ( ) . 

[0233] The PIN is made up of four numerical figures of from 0 
to 9, each of which is coded in accordance with ISO/IEC 64 6. The 
ref_thumbnail_index field indicates the information of a 
thumbnail picture added to the Volume. If the 

ref_thumbnail_index field is of a value other than OxFFFF, a 
thumbnail picture is added to the Volume. The thumbnail picture 
is stored in a menu . thmb file. The picture is referenced using 
the value of the ref_thumbnail_index in the menu. thmb file. If 
the ref_thumbnail_index field is OxFFFF, it indicates that a 
thumbnail picture has been added to the Volume. 

[0234] The TableOf PlayList ( ) in the info.dvr syntax shown in 
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Fig. 15 is explained. The TableOf PlayList ( ) stores the filename 
of the PlayList (Real PlayList and Virtual PlayList) . All 
PlayList files recorded in the Volume are contained in the 
TableOf PlayList () , which TableOf PlayList ( ) indicates the playback 
sequence of the default of the PlayList in the Volume. 
[0235] Fig. 20 shows the syntax of the TableOf PlayList () , 
which is now explained. The version_number of the 

TableOf PlayList ( ) indicates four character letters indicating the 
version numbers of the TableOf PlayLists . The vers ion_n umber must 
be encoded to "0045" in accordance with ISO 646. 
[0236] Length is an unsigned 32-bit integer indicating the 
number of bytes of the TableOf PlayList ( ) from directly after the 
length field to the trailing end of the TableOf PlayList () . The 
16-bit field of the number_of_PlayLists indicates the number of 
loops of the for-loop inclusive of the PlayList_f ile_name . This 
numerical figure must be equal to the number of PlayLists 
recorded in the Volume. The 10-byte numerical figure of the 
PlayList_f ile_name indicates the filename of the PlayLists. 
[0237] Fig. 21 shows another configuration of the syntax of 
the TableOf PlayList () . The syntax shown in Fig. 21 is comprised 
of the syntax shown in Fig. 20 in which is contained the 
UIAppInf oPlayList . By such structure including the 

UIAppInf oPlayList , it becomes possible to create a menu picture 
simply on reading out the TableOf PlayList s . The following 
explanation is premised on the use of the syntax shown in Fig. 
20 . 

[0238] The MakersPrivateData in the info.dvr shown in Fig. 15 
is explained. The MakersPrivateData is provided to permit the 
maker of the recording and/or reproducing apparatus 1 to insert 
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private data of the maker in the MakersPrivateData ( ) for special 
applications of different companies. The private data of each 
maker has standardized maker_ID for identifying the maker who has 
defined it. The MakersPrivateData ( ) may contain one or more 
maker_ID . 

[0239] If a preset maker intends to insert private data, and 
the private data of a different maker is already contained in the 
MakersPrivateData () , the new private data' is added to the 
MakersPrivateData () without erasing the pre-existing old private 
data. Thus, in the present embodiment, private data of plural 
makers can be contained in one MakersPrivateData () . 
[0240] Fig. 22 shows the syntax of the MakersPrivateData, 

which is now explained. The version_number of the 

TableOfPlayList () indicates four character letters indicating the 
version numbers of the TableOf PlayLists . The version_number must 
be encoded to "0045" in accordance with ISO 646. Length is an 
unsigned 32-bit integer indicating the number of bytes of the 
TableOfPlayList () from directly after the length field to the 
trailing end of the MakersPrivateData () . 

[0241] The mpd_blocks_s tart_address indicates the leading end 
address of the first mpd_block() in terms of the relative number 
of bytes from the leading byte of the MakersPrivateData () . The 
number_of_maker_entries is the 16-bit codeless integer affording 
the number of entries of the maker private data included in the 
MakersPrivateData () . There must not be present two or more maker 
private data having the same maker_ID values in the 
MakersPrivateData ( ) . 

[0242] The mpd_blocks_size is a 16-bit unsigned integer 

affording one mpd_block size in terms of 1024 bytes as a unit. 



For example, if mpd_block_size = 1, it indicates that the size of 
one mpd_block is 1024 bytes. The number_of _mpd_blocks is a 16- 
bit unsigned integer affording the number of mpd_blocks contained 
in the MakersPrivateData ( ) . The maker_ID is the 16-bit unsigned 
integer indicating the model number code of the DVR system which 
has created the maker private data. The value encoded to the 
maker_ID is specified by the licensor. 

[0243] The maker_model_code is a 16-bit unsigned integer 
indicating the model number code of the DVR system which has 
created the maker private data. The value encoded to the 
maker_model_code is set by the maker who has received the license 
of the format. The start_mpd_block_number is a 16-bit unsigned 
integer indicating the number of the mpd_block number at which 
begins the maker private data. The leading end of the maker 
private data must be aligned with the leading end of the 
mpd_block. The start_mpd_block_n umber corresponds to a variable 
j in the for-loop of the mpd_block. 

[0244] The mpd_length is a 32-bit unsigned integer indicating 

the size of the maker private data. The mpd_block is an area in 
which is stored maker private data. All of the mpd_blocks in the 
MakersPrivateData { ) must be of the same size. 

[0245] The real PlayList file and the Virtual PlayList file, 
in other words, xxxxx.rpls and yyyyy.vpls, are explained. Fig. 
23 shows the ^syntax of xxxxx.rpls (Real PlayList) and yyyyy.vpls 
(Virtual PlayList), which are of the same syntax structure. Each 
of the xxxxx.rpls and yyyyy.vpls is made up of three objects, 
that is PlayList (), PlayListMar'k ( ) and MakersPrivateData () . 
[0246] The PlayListMark_Start_address indicates the leading 
address of the PlayListMark ( ) , in terms of the relative number of 
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bytes from the leading end of the PlayList file as a unit. The 
relative number of bytes is counted from zero. 

[0247] The MakersPrivateData_Start_address indicates the 

leading address of the MakersPrivateData ( ) , in terms of the 
relative number of bytes from the leading end of the PlayList 
file as a unit. The relative number of bytes is counted from 
zero . 

[0248] The padding_word (padding word) is inserted in 
accordance with the syntax of the PlayList file, with Nl and N2 
being optional positive integers. Each padding word may assume 
an optional value. 

[0249] PlayList will be further explained in the following, 
although it has been explained briefly. A playback domain in all 
Clips except Bridge-Clip must be referred to by all PlayLists in 
the disc. Also, two or more Real PlayLists must not overlap the 
playback domains shown by their Playltems in the same Clip. 
[0250] Reference is made to Figs. 24A, 24B and 24C. For all 

Clips, there exist corresponding Real PlayLists, as shown in Fig. 
24A. This rule is observed even after the editing operation has 
come to a close, as shown in Fig. 24B. Therefore, all Clips must 
be viewed by referencing one of Real PlayLists. 

[0251] Referring to Fig. 24C, the playback domain of the 

Virtual PlayList must be contained in the playback domain and in 
the Bridge-Clip playback domain. There must not be present in 
the disc Bridge-Clip not referenced by any Virtual PlayList. 
[0252] The Real PlayList, containing the list of the Playltem, 

must not contain SubPlayltem. The Virtual PlayList contains the 
Playltem list and, if the CPI_type contained in the PlayList () is 
the EP_map type and the PlayList_type is 0 (PlayList containing 



video and audio) , the Virtual PlayList may contain one 
SubPlayltem. In the PlayList ( ) in the present embodiment, the 
SubPlayltem is used only for audio post recording. The number of 
the SubPlayltems owned by one Virtual PlayList must be 0 or 1 . 
[0253] The PlayList is hereinafter explained. Fig. 25 shows 
the PlayList syntax. The version_n umber indicates four character 
letters indicating the version numbers of the PlayList (). The 
version_number is encoded to "0045" in association with ISO 646. 
Length is a 32-bit unsigned integer indicating the total number 
of byte of the PlayList { ) as from directly after the length field 
to the trailing end of the PlayList (). The PlayList_type, one 
example of which is shown in Fig. 26, is an 8-bit field 
indicating the PlayList type. 

[0254] The CPI_type is a one-bit flag indicating the value of 
the CPI_type of the Clip referenced by the Playltem() and by the 
SubPlayltem ( ) . The CPI_types defined in the CPIs of all Clips 
referenced by one PlayList must be of the same values. The 
number_of_PlayItems is a 16-bit field indicating the number of 
Playltems present in the PlayList. 

[0255] The Playltem_id corresponding to the preset PlayltemO 
is defined by the sequence in which the PlayltemO appears in the 
for-loop containing the PlayltemO • The Playltem_id begins with 
0. The number_of_SubPlayItems is a 16-bit field indicating the 
number of SubPlayltems in the PlayList. This value is 0 or 1. An 
additional audio stream path (audio stream path) is a sort of a 
sub path. 

[0256] The UIAppInf oPlayLis t of the PlayList syntax shown in 
Fig. 25 is explained. The UIAppInf oPlayList stores parameters of 
the user interface application concerning the PlayList. Fig. 27 



shows the syntax of the UIAppInf oPlayList . The character_set is 
an 8-bit field indicating the method for encoding character 
letters encoded in the PlayList name field. The encoding method 
corresponds to the values conforming to the table shown in Fig. 
19 . 

[0257] The name_length is an 8-bit field indicating the byte 
length of the PlayList name indicated in the PlayList_name field. 
The PlayList_name field shows the appellation of the PlayList. 
The number of bytes of the number of the name_length counted from 
left of the field is the number of valid characters and indicates 
the PlayList appellation. The values next following these valid 
character letters may be any values. 

[0258] The record_time_and_date is a 56-bit field storing the 
date and time on which the PlayList was recorded. This field is 
14 numerical figures for year/month/day/hour/minute/second 
encoded in binary coded decimal (BCD) . For example, 
2001/12/23:01:02:03 is encoded to "0x20011223010203". 
[0259] The duration is a 24-bit field indicating the total 
replay time of the PlayList in terms of hour/minute/second as a 
unit. This field is six numerical figures encoded in binary 
coded decimal (BCD). For example, 01:45:30 is encoded to 
"0x014530" . 

[0260] The valid_period is a 32-bit field indicating the valid 

time periods .of the PlayList. This field is 8 numerical figures 
encoded in 4-bit binary coded decimal (BCD) . The valid_period is 
used in the recording and/or reproducing apparatus 1, e.g., when 
the PlayList, for which the valid period has lapsed, is to be 
automatically erased. For example, 2001/05/07 is encoded to 
"0x20010507" . 
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[0261] The maker_ID is a 16-bit unsigned integer indicating 
the maker of the DVR player (recording and/or reproducing 
apparatus 1) which has been the last to update its PlayList. The 
value encoded to maker_ID is assigned to the licensor of the DVR 
format. The maker_code is a 16-bit unsigned integer indicating 
the model number of the DVR player which has been the last to 
update the PlayList. The value encoded to the maker_code is 
determined by the maker who has received the license of the DVR 
format . 

[0262] If the flag of the playback_control_f lag is set to 1, 
its PlayList is reproduced only when the user has successfully 
entered the PIN number. If this flag is set to 0, the user may 
view the PlayList without the necessity of inputting the PIN 
number . 

[0263] If the write_protect_f lag is set to 1, the contents of 
the PlayList are not erased nor changed except the 
write_protect_flag. If this flag is set to 0, the user is free 
to erase or change the PlayList. If this flag is set to 1, the 
recording and/or reproducing apparatus 1 demonstrates a message 
requesting re-confirmation by the user before the user proceeds 
to erase, edit or overwrite the PlayList. 

[0264] The Real PlayList, in which the write_protect_f lag is 
set to 0, may exist, the Virtual PlayList, referencing the Clip 
of the Real PlayList may exist, and the write_protect_f lag of the 
Virtual PlayList may be set to 1 . If the user desires to erase 
the Real PlayList, the recording and/or reproducing apparatus 1 
issues an alarm to the user as to the presence of the 
aforementioned Virtual PlayList or "minimizes" the Real PlayList 
before erasing the Real PlayList. 
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[0265] If the is_played_f lag is set to 1, as shown in Fig. 
28B, it indicates that the PlayList was reproduced at least once 
since it was recorded, whereas, if it is set to 0, it indicates 
that the PlayList was not reproduced even once since it was 
recorded . 

[0266] Archive is a two-bit field indicating whether the 
PlayList is an original or a copy, as shown in Fig. 28C. The 
field of ref_thumbnail_index indicates the information of a 
thumbnail picture representative of the PlayList. If the 
ref_thumbnail_index field is of a value other than OxFFFF, a 
thumbnail picture representative of the PlayList is added in the 
PlayList, with the PlayList being stored in the menu.thmb file. 
The picture is referenced using the value of ref_thumbnail_index 
in the menu.thmb file. If the ref_thumbnail_index field is 
OxFFFF, no thumbnail picture representative of the PlayList is 
added in the PlayList. 

[0267] The Playltem is hereinafter explained. One PlayltemO 
basically contains the following data: Clip_Inf ormation_f ile_name 
for specifying the filename of the Clip, IN-time and OUT-time, 
paired together to specify the playback domain of Clip, 
STC_sequence_id referenced by IN-time and OUT-time in case the 
CPI_type defined in PlayList () is EP_map type, and 
Connection_Condition indicating the connection condition of 
previous Playltem and current Playltem. 

[0268] If PlayList is made up of two or more Playltems, these 

Playltems are arrayed in a row, without temporal gap or overlap, 
on the global time axis of the PlayList. If CPI_type defined in 
the PlayList is EP_map type and the current PlayList does not 
have the BridgeSequence { ) , the IN-time and OUT-time pair must 
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indicate the same time on the STC continuous domain as that 
specified by the STC_sequence_id . Such instance is shown in Fig. 
29. 

[0269] Fig. 30 shows such a case in which the CPI_type defined 

by PlayListO and, if the current Playltem has the 
BridgeSequence ( ) , the rules as now explained are applied. The 
IN_time of the Playltem previous to the current Playltem, shown 
as IN_timel, indicates the time in Bridge-Clip specified in the 
BridgeSequencelnfo () of the current Playltem. This 0UT_time must 
obey the encoding limitations which will be explained 
subsequently. 

[0270] The IN_time of the current Playltem, shown as IN_time2, 
indicates the time in Bridge-Clip specified in the 
BridgeSequencelnfo () of the current Playltem. This IN_time also 
must obey the encoding limitations as later explained. The 
OUT_time of Playltem of the current Playltem, shown as 0UT_time2, 
indicates the time on the STC continuous domain specified by 
STC_sequenbce_id of the current Playltem. 

[0271] if the CPI_type of PlayListO is TU_map type, the 
IN_time and OUT_time of Playltem, paired together, indicate the 
time on the same Clip AV stream, as shown in Fig. 31. 

[0272] The Playltem syntax is as shown in Fig. 32. As to the 
syntax of the Playltem, the field of the 

Clip_inf ormation_f ile_name indicates the filename of the Clip 
Information. The Clip_stream_type defined by the Cliplnfo ( ) of 
this Clip Information file must indicate the Clip AV stream. 

[0273] The STC_sequence_id is an 8-bit field and indicates the 
STC_sequence_id of the continuous STC domain referenced by the 
Playltem. If the CPI_type specified in the PlayListO is TU_map 
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type, this 8-bit field has no meaning and is set to 0. IN_time 
is a 32-bit field and used to store the playback start time of 
Playltem. The semantics of IN_time differs with CPI_type defined 
in the PlayList ( ) , as shown in Fig. 33. 

[0274] OUT_time is a 32-bit field and is used to store the 
playback end time of Playltem. The semantics of OUT_time differs 
with CPI_type defined in the PlayList (), as shown in Fig. 34. 
[0275] Connection_condition is a 2-bit field indicating the 
connection condition between the previous Playltem and the 
current Playltem, as shown in Fig. 35. Figs. 36A to 36D 
illustrate various states of Connection_condition shown in Fig. 
35. 

[0276] BridgeSequencelnfo is explained with reference to Fig. 
37. This BridgeSequencelnfo is the ancillary information of the 
current Playltem and includes the following information. That is, 
BridgeSequencelnfo includes Bridge_Clip_Inf ormation_f ile_name for 
specifying the Bridge_Clip AV file and a 

Bridge_Clip_Information_file_name specifying the corresponding 
Clip Information file (Fig. 45). 

[0277] It is also an address of a source packet on the Clip AV 
stream referenced by the previous Playltem. Next to this source 
packet is connected the first source packet of the Bridge-Clip AV 
stream. This address is termed the RSPN_exit_f rom_previous_Clip . 
It is also an address of the source packet on the Clip AV stream 
referenced by the current Playltem. Ahead of this source packet 
is connected the last source packet of the Bridge_clip AV stream 
file. This address is termed RSPN_enter_to_current_Clip . 
[0278] In Fig. 37, RSPN_arrival_time_discontinuity indicates 
an address of a source packet in the Bridge_Clip AV stream where 



there is a discontinuous point in the arrival time base. This 
address is defined in the ClipInfoO (Fig. 46). 

[0279] Fig. 38 shows the syntax of the BridgeSequencelnf o . 
The field of Bridge_Clip_Inf ormat ion_f ile_name indicates the 
filename of the Clip Information file corresponding to the 
Bridge_Clip_Information_file . The Clip_stream_type defined in 
ClipInfoO of this Clip information file must indicate 
'Bridge_Clip AV stream'. 

[0280] The 32-bit field of the RSPN_exit_f rom_previous_Clip is 
a relative address of a source packet on the Clip AV stream 
referenced by the previous Playltem. Next to this source packet 
is connected the first source packet of the Bridge_Clip AV stream 
file. The RSPN_exit_f rom_previous_Clip has a size based on the 
source packet number as a unit, and is counted with the value of 
the offset_SPN defined in the ClipInfoO from the first source 
packet of the Clip AV stream file referenced by the previous 
Playltem. 

[0281] The 32-bit field of RSPN_enter_to_curent_Clip is the 
relative address of the source packet on the Clip AV stream 
referenced by the current Playltem. Ahead of this source packet 
is connected the last source packet of the Bridge_Clip_AV stream 
file. The RSPN_enter_to_curent_Clip has a size that is based on 
the source packet number as a unit. The 
RSPN_enter_tq_curent_Clip is counted with the value of the 
offset_SPN, defined in the ClipInfoO from the first source 
packet of the Clip AV stream file referenced by the current 
Playltem, as an initial value. 

[0282] The SubPlayltem is explained with reference to Fig. 39. 
The use of SubPlayltem ( ) is permitted only if the CPI_type of the 
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PlayListO is the EP_map type. In the present embodiment, 
SubPlayltem is used only for audio post recording. The 
SubPlayItem( ) includes the following data. First, it includes 
Clip_Information_file_name for specifying the Clip referenced by 
the sub path in the PlayList. 

[0283] It also includes SubPath_IN_time and SubPath_OUT_time 
for specifying the sub path playback domain in the Clip. 
Additionally, it includes sync_PlayI tem_id and 

start_PTS_of_PlayItem for specifying the time of starting the sub 
path reproduction on the main path time axis. The Clip AV 
stream, referenced by the sub path, must not contain STC 
discontinuous points (discontinuous points of the system time 
base) . The clocks of audio samples of the Clip used in the sub 
path are locked to the clocks of the audio samples of the main 
path . 

[0284] Fig. 40 shows the syntax of the SubPlayltem. The field 
of the Clip_Inf ormation_f ile_name indicates the filename of the 
Clip Information file and is used by a sub path in the PlayList. 
The Clip_stream_type defined in this ClipInfoO must indicate the 
Clip AV stream. 

[0285] An 8-bit field of sync_PlayItem_id indicates the sub 
path type. Here, only '0x00' is set, as shown in Fig. 41, while 
other values are reserved for future use. 

[0286] The 8-bit field of sync_PlayI tem_id indicates the 
Playltem_id of the Playltem containing the time of playback start 
of the sub path on the time axis of the main path. The value of 
Playltem_id corresponding to the preset Playltem is defined in 
the PlayListO (Fig. 25). 

[0287] A 32-bit field of sync_start_PTS_of_PlayItem denotes 
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the time of playback start of the sub path on the time axis of 
the main path, and denotes the upper 32 bits of the PTS 
(presentation time stamp) on the Playltem referenced by the 
sync_PlayItem_id. The upper 32 bit field of the SubPath_IN_time 
stores the playback start time of the sub path. SubPath_IN_time 
denotes upper 32 bits of the PTS of 33 bits corresponding to the 
first presentation unit in the sub path. 

[0288] The upper 32 bit field of subPath_OUT_time stores the 
playback end time of the sub path. SubPath_OUT_time indicates 
upper 32 bits of the value of the Presentation_end_TS calculated 
by the following equation: 

Presentation_end_TS = PTS_OUT + AU_duration 
where PTS_out is the PTS of the 33 bit length corresponding to 
the last presentation unit of the SubPath and AU_duration is the 
90 kHz based display period of the last presentation unit of the 
SubPath. 

[0289] Next, PlayLis tMar k ( ) in the syntax of xxxxx.rpls and 
yyyyy.vpls shown in Fig. 23 is explained. The mark information 
pertinent to the PlayLis t is stored in this PlayListMark . Turning 
to the syntax of the PlayListMark, shown in Fig. 42, 
version_number is four character letters indicating the version 
number of this PlayListMark (} . The version_number must be 
encoded to "0045" in accordance with ISO 646. 

[0290] Length is an unsigned 32-bit integer indicating the 

number of bytes of PlayListMark ( ) from directly after the length 
field to the trailing end of the PlayListMark { ) . The 
number_of_PlayListMarks is a 16-bit unsigned integer indicating 
the number of marks stored in the PlayListMark. The 
number_of_PlayListMarks may be zero. The mark_type is an 8-bit 
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field indicating the mark type and is encoded in the table shown 
in Fig. 43. 

[0291] A 32-bit field of mark_time_stamp stores a time stamp 
indicating the point specified by the mark. The semantics of the 
mark_time_stamp differ with CPI_type defined in the PlayListO, 
as shown in Fig. 44. The Playltem_id is an 8-bit field 
specifying the Playltem where the mark is put. The values of 
Playltem_id corresponding to a preset Playltem are defined in the 
PlayListO (see Fig. 25). 
y. [0292] An 8-bit field of character_set shows the encoding 

: method of character letters encoded in the mark_name field. The 

"=ijf encoding method corresponds to values shown in Fig. 19. The 8- 

bit field of name_length indicates the byte length of the mark 
yj name shown in the mark_name field. The mark_name field denotes 

|=v the mark name indicated in the mark_name field. The number of 

bytes corresponding to the number of name_lengths from the left 
"Z of this field is the effective character letters and denotes the 

^ mark name. In the mark_name field, the value next following 

these effective character letters may be arbitrary. 
[0293] The field of the ref _thumbnail_index denotes the 
information of the thumbnail picture added to the mark. If the 
field of the ref _thumbnail_index is not OxFFFF, a thumbnail 
picture is added to its mark, with the thumbnail picture being 
stored in the mark.thmb file. This picture is referenced in the 
mark.thmb file, using the value of ref_thumbnail_index, as 
explained subsequently. If the ref_thumbnail_index field is 
OxFFFF, it indicates that no thumbnail picture is added to the 
mark . 

[0294] The Clip Information file is now explained. The 
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zzzzz.clpi (Clip Information file) is made up of six objects, as 
shown in Fig. 45. These are ClipInfoO, STC_Info(), Programf) , 
CPI(), ClipMarkO and MakersPrivateData ( ) . For the AV stream 
(Clip AV stream or Bridge-Clip AV stream) and the corresponding 
Clip Information file, the same string of numerals "zzzzz" is 
used . 

[0295] The syntax of zzzzz.clpi (Clip Information file), 

shown in Fig. 45, is now explained. The Cliplnf o_Start_address 

indicates the leading end address of ClipInfoO with the relative 

, number of bytes from the leading end byte of the zzzzz.clpi file 

.; as a unit. The relative number of bytes is counted from zero. 

W [0296] The STC Info Start address indicates the leading end 

« - - _ 

address of STC_Info with the relative number of bytes from the 

O 

yj leading end byte of the zzzzz.clpi file as a unit. The 

y: ProgramInfo_Start_address indicates the leading end address of 

Programlnfo () with the relative number of bytes from the leading 
end byte of the zzzzz.clpi file as a unit. The relative number 

O 

M= of bytes is counted from 0. The CPI_Start_address indicates the 

leading end address of CPI() with the relative number of bytes 
from the leading end byte of the zzzzz.clpi file as a unit. The 
relative number of bytes is counted from zero. 

[0297] The ClipMark_Start_address indicates the leading end 
address of ClipMarkO with the relative number of bytes from the 
leading end byte of the zzzzz.clpi file as a unit. The relative 
number of bytes is counted from zero. The_MakersPrivateData 
Start_address indicates the leading end address of 
MakersPrivateData O with the relative number of bytes from the 
leading end byte of the zzzzz.clpi file as a unit. The relative 
number of bytes is counted from zero. The padding_word is 
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inserted in accordance with the syntax of the zzzzz.clpi file. 
Nl, N2, N3, N4 and N5 must be zero or optional positive integers. 
The respective padding words may also assume optional values . 
[0298] The Cliplnfo is now explained. Fig. 46 shows the 
syntax of Cliplnfo. In the Cliplnfo () is stored the attribute 
information of corresponding AV stream files (Clip AV stream or 
Bridge-Clip AV stream file) . 

[0299] Turning to the syntax of the Cliplnfo shown in Fig. 46, 

version_number is the four character letters indicating the 
Li version number of this Cliplnfo (). The version_number must be 

encoded to "0045" in accordance with the ISO 646. Length is a 
% 32-bit unsigned integer indicating the number of bytes of 

Cliplnfo () from directly at back of the length field to the 
yj trailing end of the Cliplnf o ( ) . An 8-bit field of 

§=4= Clip_stream_type indicates the type of the AV stream 

til 

f(l corresponding to the Clip Information file, as shown in Fig. 47. 

y. 

The stream types of the respective AV streams will be explained 
*** subsequently. 

[0300] The 32-bit field of offset_SPN gives an offset value of 
the source packet number of the first source packet of the AV 
stream (Clip AV stream or the Bridge-Clip AV stream) . When the 
AV stream file is first recorded on the disc, this offset_SPN 
must be zero. 

[0301] Referring to Fig. 48, when the beginning portion of the 
AV stream file is erased by editing, offset_SPN may assume a 
value other than 0. In the present embodiment, the relative 
source packet number (relative address) referencing the 
offset_SPN is frequently described in the form of RSPNxxx, where 
xxx is modified such that RSPN xxx is RSPN EP start. The 
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relative source packet number is sized with the source packet 
number as a unit and is counted from the first source packet 
number of the AV stream file with the value of the offset_SPN as 
the initial value. 

[0302] The number of source packets from the first source 
packet of the AV stream file to the source packet referenced by 
the relative source packet number (SPN_xxx) is calculated by the 
following equation: 

SPN_xxx = RSPN_xxx - offset_SPN. 
[0303] Fig. 48 shows an instance in which offset_SPN is 4. 
[0304] TS_recording_rate is a 2 4-bit unsigned integer, which 
affords an input/output bit rate required for the AV stream to 
the DVR drive (write unit 22) or from the DVR drive (readout unit 
28). The record_time_and_date is a 56-bit field for storing the 
date and time of recording of the AV stream corresponding to the 
Clip and is an encoded representation of 

year/month/day/hour/minute in 4-bit binary coded decimal (BCD) 
for 14 numerical figures. For example, 2001/2/23:01:02:03 is 
encoded to "0x20011223010203". 

[0305] The duration is a 24-bit field indicating the total 
playback time of the Clip by hour/minute/second based on arrival 
time clocks. This field is six numerical figures encoded in 4- 
bit binary coded decimal (BCD). For example, 01:45:30 is encoded 
to "0x014530",. 

[0306] A flag t ime_controlled_f lag indicates the recording 
mode of an AV stream file. If this time_controlled_f lag is 1, it 
indicates that the recording mode is such a mode in which the 
file size is proportionate to the time elapsed since recording, 
such that the condition is shown by the following equation: 
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Ts_average_rate*192/188* ( t-start_time ) -a<= size_clip (t) 
<= TS_average_rate*192/188* (t - start_time) + a 
where TS_average_rate is an average bit rate of the transport 
stream of the AV stream file expressed by bytes/second. 

[0307] In the above equation, t denotes the time in seconds, 
while start_time is the time point when the first source packet 
of the AV stream file was recorded. The size_clip(t} is 10*192 
bytes and a is a constant dependent on TS_average_rate . 

[0308] If time_controlled_f lag is set to 0, it indicates that 
the recording mode is not controlling so that the time lapse of 
recording is proportionate to the file size of the AV stream. 
For example, the input transport stream is recorded in a 
transparent fashion. 

[0309] If time_controlled_flag is set to 1, the 24-bit field 
of TS_average_rate indicates the value of TS_average_rate used in 
the above equation. If time_cont rolled_f lag is set to 0, this 
field has no meaning and must be set to 0. For example, the 
variable bit rate transport stream is encoded by the following 
sequence: first, the transport rate is set to the value of 
TS_recording_rate . The video stream is encoded with a variable 
bit rate. The transport packet is intermittently encoded by not 
employing null packets . 

[0310] The 32-bit field of RSPN_arrival_time_discontinuity is 
a relative . address of a site where arrival timebase 
discontinuities are produced on the Bridge-Clip AV stream file. 
The RSPN_arrival_time_discontinuity is sized with the source 
packet number as a unit and is counted with the value of 
offset_SPN defined in the ClipInfoO as from the first source 
packet of the Bridge-Clip AV stream file. An absolute address in 
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the Bridge-Clip AV stream file is calculated based on the 
aforementioned equation: 

SPN_xxx = RSPN_xxx - offset_SPN. 
[0311] The 144-bit field of reserved_f or_sys tem_use is 

reserved for a system. If is_f ormat_identif ier_valid flag is 1, 
it indicates that the field of f ormat_identif ier is effective. 
If is_f ormat_identif ier_valid flag is 1, it indicates that the 
format_identif ier field is valid. If 

is_original_network_ID_valid flag is 1, it indicates that the 
field of original_network_ID is valid. If the flag 

is_transport_stream_ID_valid is 1, it indicates that the 
transport_stream_ID field is valid. If is_service_ID_valid flag 
is 1, it indicates that the service_ID field is valid. 
[0312] If is_country_code_valid flag is 1, it indicates that 
the field country_code is valid. The 32-bit field of 

f ormat_identif ier indicates the value of f ormat_identif ier owned 
by a registration descriptor (defined in ISO/IEC13818-1 ) in the 
transport stream. The 16-bit field of original_network_ID 
indicates the value of the original_network_ID defined in the 
transport stream. 

[0313] The 16-bit field in service_ID denotes the value of 
service_ID defined in the transport stream. The 24-bit field of 
country_code shows a country code defined by IS03166. Each 
character code is encoded by IS08859-1. For example, Japan is 
represented as "JPN" and is encoded to "0x4A 0x50 0x4E". The 
stream_f ormat_name is 15 character codes of ISO-646 showing the 
name of a format organization affording stream definitions of 
transport streams. An invalid byte in this field has a value of 
' OxFF' . 
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[0314] Format_identif ier, original_network_ID, 
transport_stream_ID, service_ID, count ry_code and 

stream_f ormat_name indicate service providers of transport 
streams. This allows to recognize encoding limitations on audio 
or video streams and stream definitions of private data streams 
other than audio/video streams or SI (service information). This 
information can be used to check if the decoder is able to decode 
the stream. If such decoding is possible, the information may be 
used to initialize the decoder system before starting the 
decoding . 

[0315] STC_Info is now explained. The time domain in the 
MPEG-2 transport stream not containing STC discontinuous 
points (discontinuous points of the system time base) is termed 
the STC_sequence . In the Clip, STC_sequence is specified by the 
value of STC_sequence_id. Figs. 50A and 50B illustrate a 
continuous STC domain. The same STC values never appear in the 
same STC_sequence, although the maximum time length of Clip is 
limited, as explained subsequently. Therefore, the same PTS 
values also never appear in the same STC_sequence . If the AV 
stream contains N STC discontinuous points, where N > 0, the Clip 
system time base is split into (N+l) STC_sequences . 
[0316] STC_Info stores the address of the site where STC 
discontinuities (system timebase discontinuities) are produced. 
As explained, with reference to Fig. 51, the RSPN_STC_start 
indicates the address and begins at a time point of arrival of 
the source packet referenced by the (k+l)st RSPN_STC_start and 
ends at a time point of arrival of the last source packet. 
[0317] Turning to the syntax of STC_Info shown in Fig. 52, 
version_number is four character letters indicating the version 
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number of STC_Info() . The version_n umber must be encoded to 
"0045" in accordance with ISO 646. 

[0318] Length is a 32-bit unsigned integer indicating the 
number of bytes of STC_Info() from directly after this length 
field to the end of STC_Inf o ( ) . If CPI_type of CPI ( ) indicates 
TU_map type, 0 may be set in this length field. If CPI_type of 
CPI() indicates EP_map type, the num_of_STC_sequences must be of 
a value not less than 1. 

[0319] An 8-bit unsigned integer of num_of_STC_sequences 
indicates the number of sequences in the Clip. This value 
indicates the number of the for-loops next following the field. 
The STC_sequence_id corresponding to the preset STC_sequence is 
defined by the order in which appears the RSPN_STC_start 
corresponding to the STC_sequence in the for-loop containing the 
RSPN_STC_start . The STC_sequence_id commences at 0. 
[0320] The 32-bit field of RSPN_STC_start indicates an address 
at which the STC_sequence commences on the AV stream file. 
RSPN_STC_start denotes an address where system time base 
discontinuities are produced in the AV stream file. The 
RSPN_STC_start may also be a relative address of the source 
packet having the first PCR of the new system time base in the AV 
stream. The RSPN_STC_start is of a size based on the source 
packet number and is counted from the first source packet of the 
AV stream file with the offset_SPN value defined in ClipInfoO as 
an initial value. In this AV stream file, the absolute address 
is calculated by the above-mentioned equation, that is 

SPN_xxx = RSPN_xxx - offset_SPN. 
[0321] Programlnfo in the syntax of zzzz.clip shown in Fig. 45 
is now explained with reference to Fig. 53. The time domain 
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having the following features in the Clip is termed 
program_sequence . These feature are that the value of PCR_PID is 
not changed, the number of video elementary streams is also not 
changed, the PID values in the respective video streams are not 
changed, the encoding information which is defined by 
VideoCodinglnf o thereof is not changed, the number of audio 
elementary streams is also not changed, the PID values of the 
respective audio streams are not changed, and that the encoding 
information, which is defined by AudioCodinglnf o thereof, is not 
changed. 

[0322] Program_sequence has only one system time base at the 
same time point. Program_sequence has a sole PMT at the same time 
point. Programlnf o ( ) stores the address of the site where the 
program_sequence commences. RSPN_program_sequence_start 
indicates the address. 

[0323] Fig. 54 illustrates the syntax of Programlnf o. Turning 
to the Programlnf o shown in Fig. 54, version_n umber is four 
character letters indicating the version number of Programlnfo ( ) . 
The version_number must be encoded to "0045" in accordance with 
ISO 646. 

[0324] Length is a 32-bit unsigned integer indicating the 
number of bytes of Programlnf o ( ) from directly at back of this 
length field to the end of Programlnf o () . If CPI_type of CPI ( ) 
indicates the_ TU_map type, this length field may be set to 0 . If 
the CPI_type of CPI ( ) indicates EP_map type, the 
number_of_programs must be of a value not less than 1. 
[0325] An 8-bit unsigned integer of 

number_of_program_sequences denotes the number of 

program_sequences in the Clip. This value indicates the number 
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of for-loops next following this field. If program_sequence in 
the Clip is not changed, 1 must be set in the number of 
program_sequences . A 32-bit field of RSPN_program_sequence_start 
is a relative address where the program sequence commences on the 
AV stream. 

[0326] RSPN_program_sequence_start is sized with the source 
packet number as a unit and is counted with the value of 
offset_SPN defined in the ClipInfoO as from the first source 
packet of the AV stream file. An absolute address in the AV 
1^ stream file is calculated by: 

SPN_xxx = RSPN_xxx - offset_SPN. 
[0327] The values of RSPN_program_sequence_start in the for- 
loop syntax must appear in the rising order. 
|ij [0328] A 16-bit field of PCR_PID denotes the PID of the 

Lii transport packet containing an effective PCR field effective for 

the program_sequence . An 8-bit field of number_of_audios 

r Z indicates the number of for-loops containing audio stream_PID and 

M 

*** AudioCodinglnf o ( ) . A 16-bit field of video_st ream_PID indicates 

the PID of the transport packet containing a video stream 
effective for its program_sequence . VideoCodinglnf o ( ) , next 
following this field, must explain the contents of the video 
stream referenced by its video_s tream_PID . 

[0329] A 16-bit field of audio_stream_PID indicates the PID of 
a transport packet containing the audio stream effective for its 
program_sequence . The AudioCodinglnf o () , next following this 
field, must explain the contents of the video stream referenced 
by its audio_s tream_PID . 

[0330] The order in which the values of video_s tream_PID in 

the for-loop of the syntax must be equal to the sequence of PID 
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encoding of the video stream in the PMT effective for the 
program_sequence . Additionally, the order in which the values of 
audio_stream_PID appears in the for-loop of the syntax must be 
equal to the sequence of PID encoding of the audio stream in the 
PMT effective for the program_sequence . 

[0331] Fig. 55 shows the syntax of VideoCodinglnf o in the 

syntax of the Programlnfo shown in Fig. 54. Turning to the syntax 
of the VideoCodinglnf o shown in Fig. 55, an 8-bit field of 
video_format indicates the video format corresponding to 
video_stream_PID in Programlnfo () , as shown in Fig. 56. 
[0332] Referring to Fig. 57, an 8-bit field of frame_rate 
indicates the video frame rate corresponding to the 
video_stream_PID in Programlnfo () . An 8-bit field of 

display_aspect_ratio indicates a video display aspect ratio 
corresponding to video_s tream_PID in Programlnfo () . 
[0333] Fig. 59 shows the syntax of AudioCodinglnf o in the 
syntax of Programlnfo shown in Fig. 54. Turning to the syntax of 
the AudioCoding Info shown in Fig. 59, an 8-bit field of 
audio_format indicates the audio encoding method corresponding to 
audio_stream_PID in Programlnfo () , as shown in Fig. 60. 
[0334] An 8-bit field of audio_component_type indicates an 
audio component type corresponding to audio_stream_PID in 
Programlnf o ( ) , as shown in Fig. 61, whilst an 8-bit field of 
sampling_f requency indicates an audio sampling frequency 
corresponding to audio_stream_PID in Programlnf o {) , as shown in 
Fig. 62. 

[0335] The CPI (Characteristics Point Information) in the 
syntax of zzzzz.clip shown in Fig. 45 is explained. The CPI is 
used for correlating the time information in the AV stream with 
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the address in its file. The CPI is of two types, namely EPjnap 
and TU_map. In Fig. 63, if CPI_type in CPI() is EP_map, its 
CPI ( ) contains EP_map. In Fig. 64, if CPI_type in CPI ( ) is 
TU_map, its CPI() contains TU_map . One AV stream has one EP_map 
or one TU_map . If the AV stream is an SESF transport stream, the 
corresponding Clip must own an EP_map. 

[0336] Turning to the syntax of CPI shown in Fig. 65, the 
version_number is four character letters indicating the version 
number of this CPI(). The version_n umber must be encoded to 
"0045" in accordance with ISO 646. Length is a 32-bit unsigned 
integer indicating the number of bytes as from directly after 
this length field to the trailing end of the CPI ( ) . The CPI_type 
is a 1-bit flag and indicates the CPI type of Clip, as shown in 
Fig. 66. 

[0337] The EP_map in the CPI syntax shown in Fig. 65 is 

explained. There are two types of the EP_map, that is EP_map for 
a video stream and an EP_map for an audio stream. The 
EP_map_type in the EP_map differentiates between these EP_map 
types. If the Clip contains one or more video streams, the 
EP_map for the video stream must be used. If the Clip does not 
contain a video stream but contains one or more audio streams, 
the EP_map for the audio stream must' be used. 

[0338] The EP_map for a video stream is explained with 
reference to_Fig. 67. The EP_map for the video stream has data 
stream_PID, PTS_EP_start and RSPN_EP_start . The stream_PID shows 
the PID of the transport packet transmitting a video stream. The 
PTS_EP_start indicates the PTS of an access unit beginning from 
the sequence header of the video stream. The RSPN_RP_start 
indicates the address of a source packet including the first byte 
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of the access unit referenced by the PTS_EP_start in the AV 
stream. 

[0339] A sub table, termed EP_map_f or_one_s tream_PID ( ) is 
created from one video stream transmitted by the transport packet 
having the same PID to another. If plural video streams exist in 
the Clip, the EP_map may contain plural 

EP_map_f or_one_stream_PID ( ) . 

[0340] The EP_map for audio stream has data stream_PID, 
PTS_EP_start and RSPN_EP_start . The stream_PID shows a PID of a 
transport packet transmitting an audio stream. The PTS_EP_start 
shows the PTS of an accessing unit in the audio stream. The 
RSPN_EP-start indicates an address of a source packet containing 
a first byte of the access unit referenced by PTS_EP_start of the 
AV stream. 

[0341] The sub table termed EP_map_f or_one_stream_PID ( ) is 

created from one audio stream transmitted by the transport packet 
having the same PID to another. If there exist plural audio 
streams in the Clip, EP_map may contain plural 
EP_map_f or_one_stream_PID ( ) . 

[0342] Turning to the relation between EP_map and STC_Info, 
one EP_map_f or_one_stream_PID ( ) is created in one table 
irrespective of discontinuous points in the STC. Comparison of 
the value of the RSPN_EP_start to the value of RSPN_STC_start 
defined in STC_Info() reveals the boundary of data of EP_map 
belonging to respective STC_seguences (see Fig. 68) . The EP_map 
must have one EP_map_f or_one_stream_PID ( ) for a continuous stream 
range transmitted by the same PID. In the case shown in Fig. 
69, programtl and program#3 have the same video PID, however, the 
data range is not continuous, so that EP_map_f or_one_stream_PID ( ) 
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must be provided for each program. 

[0343] Fig. 70 shows the EPjmap syntax. By way of explanation 

of the EP_map syntax shown in Fig. 70, the EP_type is a 4-bit 
field and shows the EP_map entry point type, as shown in Fig. 71. 
The EP_type shows the semantics of the data field next following 
this field. If Clip includes one or more video stream, the 
EP_type must be set to 0 ('video') . Alternatively, if the Clip 
contains no video stream but contains one or more audio stream, 
then EP_type must be set to 1 ('audio'). 

[0344] The 16-bit field of number_of_stream_PIDs indicates the 
number of times of loops of the for-loop having 
number_of_stream_PIDs in the EP_map ( ) as a variable. The 16-bit 
field of stream_PID (k) indicates the PID of the transport packet 
transmitting the number k elementary stream (video or audio 
stream) referenced by EP_map_f or_one_s tream_PID 

(num_EP_entries ( k) ) . If EP_type is 0 ('video'), its elementary 
stream must be a video stream. If EP_type is equal to 1 
('audio'), its elementary stream must be the audio stream. 
[0345] The 16-bit field of num_EP_entries ( k) indicates the 
num_EP_entries (k) referenced by EP_map_entries (k) . The 
EP_map_for_one_stream_PID_Start_address (k) : this 32-bit field 
indicates the relative address position at which the 
EP_map_for_one_stream_PID (num_EP_entries (k) ) begins in the 
EP_map() . This value is indicated by the size as from the first 
byte of the EP_map(). 

[0346] Padding_word must be inserted in accordance with the 
EP_map() syntax. X and Y must be optional positive integers. 
The respective padding words may assume any optional values. 
[0347] Fig. 72 shows the syntax of EP_map_f or_one_stream_PID . 
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By way of explanation of the syntax of the 
EP_map_f or_one_stream_PID shown in Fig. 72, the semantics of the 
32-bit field of PTS_EP_start differs with the EP_type defined by 
EP_map(). If EP_type is equal to 0 ('video'), this field has 
upper 32 bits of the 33-bit precision PTS of the access unit 
beginning with a sequence header of the video stream. If the 
EP_type is equal to 1 ('audio'), this field has upper 32 bits of 
PTS of 33 bit precision of the access unit of the audio stream. 
[0348] The semantics of the 32-bit field of RSPN_EP_start 
differs with the EP_type defined in EP_map(). If EP_type is 
equal to 0 ('video'), this field indicates the relative address 
of the source packet including the first byte of the sequence 
header of the access unit referenced by the PTS_EP_start in the 
AV stream. Alternatively, if EP_type is equal to 1 ('audio'), 
this field indicates the relative address of the source packet 
containing the first byte in the audio stream of the access unit 
referenced by the PTS_EP_start in the AV stream. 

[0349] RSPN_EP_start is of a size which is based on the source 

packet number as a unit, and is counted from the first source 
packet of the AV stream file, with the value of the offset_SPN, 
defined in ClipInfoO, as an initial value. The absolute address 
in the AV stream file is calculated by 

SPN_xxx = RSPN_xxx - offset_SPN. 
[0350] It .is noted that the value of the RSPN_EP_start in the 
syntax must appear in the rising order. 

[0351] The TU_map is now explained with reference to Fig. 73. 

TU_map forms a time axis based on the source packet arrival time 
clock (timepiece of the arrive time base) . This time axis is 
termed TU_map_time_axis . The point of origin of TU_map_time_axis 

67 



is indicated by offset_time in the TU_map ( ) . TU_map_time_axis is 
divided in a preset unit as from of f set_time , this unit being 
termed time_unit. 

[0352] In each time_unit in the AV stream, addresses on the AV 
stream file of the source packet in the first complete form are 
stored in TU_map . These addresses are termed 

RSPN_time_unit_start . The time at which begins the k(k > 0)th 
time_unit on the TU_map_time_axis is termed TU_start_time ( k) . 
This value is calculated based on the following equation: 

TU_start_time (k) = offset_time + k*time__unit_size . 
[0353] It is noted that TU_start_time ( k) has a precision of 45 
kHz. 

[0354] Fig. 74 shows the syntax of TU_map . By way of 
explanation of the TU_map syntax shown in Fig. 74, the 32-bit 
field of offset_time gives an offset time relative to 
TU_map_time_axis . This value indicates the offset time relative 
to the first time_unit in the Clip. The offset_time is of a size 
based on 45 kHz clock derived from the 27 MHz precision arrival 
time clocks as a unit. If the AV stream is to be recorded as new 
Clip, offset_time must be set to 0. 

[0355] The 32-bit field of time_unit_size affords the size of 
the time_unit, and is based on 45 kHz clocks, derived from the 27 
MHz precision arrival time clocks, as a unit. Preferably, 
time_unit_siz-e is not longer than one second ( time_unit_size < 
45000) . The 32 bit field of number_of_time_unit_entries 
indicates the number of entries stored in TU_map ( ) . 
[0356] The 32-bit field of RSN_time_unit_start indicates the 
relative address of a site in the AV stream at which begins each 
time_unit. RSN_time_unit_start is of a size based on the source 
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packet number as unit and is counted with the value of offset_SPN 
defined in ClipInfoO as from the first source packet of the AV 
stream file as an initial value. The absolute address in the AV 
stream file is calculated by 

SPN_xxx = RSPN_xxx - offset_SPN. 
[0357] It is noted that the value of RSN_time_unit_start in 

the for-loop of the syntax must appear in the rising order. If 
there is no source packet in the number (k+1) time_unit, the 
number (k+1) RSN_time_unit_start must be equal to the number k 
RSPN_time_unit_start . 

[0358] By way of explanation of the ClipMark in the syntax of 
zzzzz.clip shown in Fig. 45, the ClipMark is the mark information 
pertinent to Clip and is stored in the ClipMark. This mark is 
not set by a user, but is set by a recorder (recording and/or 
reproducing apparatus 1) . 

[0359] Fig. 75 shows the ClipMark syntax. By way of 

explanation of the ClipMark syntax shown in Fig. 75, the 
version_number is four character letters indicating the version 
number of this ClipMark. The version_number must be encoded in 
accordance with ISO 646 to "0045". 

[0360] Length is a 32-bit unsigned integer indicating the 
number of bytes of the ClipMark () as from directly after the 
length field to the trailing end of ClipMark (). The 
number_of_Clip_marks is a 16-bit unsigned integer indicating the 
number of marks stored in ClipMark and may be equal to 0. 
Mark_type is an 8-bit field indicating the mark type and is 
encoded in accordance with the table shown in Fig. 76. 
[0361] Mark_time_stamp is a 32-bit field and stores the time 
stamp indicating a pointer having a specified mark. The 
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semantics of mar k_time_ stamp differs with CPI_type in the 
PlayListO, as shown in Fig. 77. 

[0362] If CPI_type in CPI() indicates the EP_map type, this 8- 
bit field indicates the STC_sequence_id of the continuous STC 
domain where there is placed mark_time_s tamp . If CPI_type in 
CPIO indicates TU_map type, this 8-bit field has no meaning but 
is set to 0. The 8-bit field of character_set indicates the 
indicating method of character letters encoded in the mark_name 
field. The encoding method corresponds to the value shown in 
Fig. 19. 



[0363] The 8-bit field of name_length indicates the byte 
length of the mark name shown in the mark_name field. This 
mark_name field indicates the mark name. The byte number 
corresponding to the number of the name_length from the left of 
this field is the effective character number and denotes the mark 
name. In the mark_name field, the values next following these 
effective character letters may be arbitrary. 

[0364] The field of ref_thumbnail_index indicates the 
information of the thumbnail picture appended to the mark. If 
the ref_thumbnail_index field is of a value different from 
OxFFFF, a thumbnail picture is added to its mark, with the 
thumbnail picture being stored in the mark.thmb file. This 
picture is referenced using the value of ref_thumbnail_index in 
the mark.thmb file. If the ref _thumbnail_index field is of a 
value equal to OxFFFF, a thumbnail picture is not appended to its 
mark . 

[0365] MakerPrivateData has already been explained with 
reference to Fig. 22 and hence is not explained here 
specifically . 

[0366] Next, thumbnail_information is explained. A thumbnail 
picture is stored in a menu.thmb file or in a mark.thmb file. 
These files are of the same syntax structure and own a sole 
Thumbnail (). The menu.thmb file stores a picture representing 
respective PlayLists. The totality of menu thumbnails are stored 
in the sole menu.thmb file. 

[0367] The mark.thmb file stores a mark thumbnail picture, 
that is, a picture representing a mark point. The totality of 
mark thumbnails corresponding to the totality of PlayLists and 
Clips are stored in the sole mark.thmb file. Since the 



thumbnails are frequently added or deleted, the operation of 
addition and partial deletion must be executable readily and 
speedily. For this reason, Thumbnail () has a block structure. 
Picture data is divided into plural portions each of which is 
stored in one tn_block. One picture data is stored in 
consecutive tn_blocks . In the string of tn_blocks, there may 
exist a tn_block not in use. The byte length of a sole thumbnail 
picture is variable. 

[0368] Fig. 78 shows the syntax of menu.thmb and mark.thmb and 
Fig. 7 9 the syntax of Thumbnail in the syntax of menu.thmb and 
mark.thmb shown in Fig. 78. By way of explanation of the syntax 
of Thumbnail, shown in Fig. 7 9, version_number is four character 
letters denoting the version number of this Thumbnail (). 
Version_number must be encoded to "0045" in accordance with ISO 
646 . 

[0369] Length is a 32-bit unsigned integer indicating the 
number of bytes of MakerPrivateData ( ) as from directly at back of 
the length field up to the trailing end of Thumbnail (). 
Tu_block_start_address is a 32-bit unsigned integer indicating 
the leading end byte address of the first tn_block, in terms of 
the relative number of bytes as from the leading end byte of 
Thumbnail () as a unit. The number of relative bytes is counted 
from 0. Number_of_thumbnails is a 16-bit unsigned integer which 
gives the number of entries of a thumbnail picture contained in 
Thumbnail ( ) . 

[0370] Tu_block_size is a 16-bit unsigned integer which gives 
the size of one tn_block, in terms of 1024 bytes as a unit. If, 
for example, tn_block_size =1, it indicates that the size of one 
tn_block is 1024 bytes. Number_of_tn_blocks is a 116-bit 
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unsigned integer indicating the number of entries of tn_block in 
this Thumbnail () . Thumbnail_index is a 16-bit unsigned integer 
indicating the index number of the thumbnail picture represented 
by the thumbnail information for one for-loop beginning from the 
thumbnail_index field. The value OxFFFF must not be used as 
Thumbnail_index. This Thumbnail_index is referenced by 

ref_thumbnail_index in UIAppInf oVolume ( ) , UIAppInf oPlayList ( ) , 
PlayListMark ( ) and ClipMark ( ) . 

[0371] Thumbnail_picture_f ormat is an 8-bit unsigned integer 
representing the picture format of a thumbnail picture and 
assumes a value shown in Fig. 80. In the table, DCF and PNG are 
allowed only in menu.thmb. The mark thumbnail must assume the 
value of "0x00" (MPEG-2 Video 1-picture) . 
O [0372] Picture_data_size is a 32-bit unsigned integer 

w 

* indicating the byte length of a thumbnail picture in terms of 

§=1= 

m bytes as a unit. Start_tn_block_number is a 16-bit unsigned 

JT integer indicating the tn_block number of the tn_block where data 

of the thumbnail picture begins. The leading end of the 
thumbnail picture data must coincide with the leading end of the 
tn_block. The tn_block number begins from 0 and is relevant to 
the value of a variable k in the for-loop of tn_block. 
[0373] X_picture_length is a 16-bit unsigned integer 
indicating the number of pixels in the horizontal direction of a 
frame of a thumbnail picture. Y_picture_length is a 16-bit 
unsigned integer indicating the number of pixels in the vertical 
direction of a frame of a thumbnail picture. Tn_block is an area 
in which to store a thumbnail picture. All tn_block in the 
Thumbnail () are of the same size (fixed length) and are of a size 
defined by tn_block_sixe . 
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[0374] Figs. 81A and 8 IB schematically show how thumbnail 
picture data are stored in tn_block. If, as shown in Figs. 81A 
and 81B, the thumbnail picture begins at the leading end of 
tn_block, and is of a size exceeding 1 tn_block, it is stored 
using the next following tn_block. By so doing, data with a 
variable length can be managed as fixed length data, so that the 
editing of deletion can be coped with by simpler processing. 
[0375] An AV stream file is now explained. The AV stream file 

is stored in the "M2TS" directory (Fig. 14) . There are two types 
of AV stream files, namely, a Clip A stream file and a Bridge- 
Clip AV stream file. Both AV streams must be of the structure of 
DVR MPEG-2 transport stream file as hereinafter defined. 
[0376] First, the DVR MPEG2 transport stream is explained. 

The structure of the DVR MPEG-2 transport stream is shown in Fig. 
82. The AV stream file has the structure of a DVR MPEG 2 
transport stream. The DVR MPEG 2 transport stream is made up of 
an integer number of Aligned units. The size of the Aligned unit 
is 6144 bytes (2048*3 bytes) . The Aligned unit begins from the 
first byte of the source packet. The source packet is 192 bytes 
long. One source packet is comprised of TP_extra_header and a 
transport packet. TP_extra_header is 4 bytes long, with the 
transport packet being 188 bytes long. 

[0377] One Aligned unit is made up of 32 source packets. The 
last Aligned unit in the DVR MPEG 2 transport stream is also made 
up of 32 source packets. Therefore, the DVR MPEG 2 transport 
stream ends at a boundary of the Aligned unit. If the number of 
the transport packets of the input transport stream recorded on a 
disc is not a multiple of 32, a source packet having a null 
packet (transport packet of PID = OxlFFFF) must be used as the 
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last Aligned unit. The file system must not use excess 
information in the DVR MPEG 2 transport stream. 

[0378] Fig. 83 shows a recorder model of the DVR MPEG 2 
transport stream. The recorder shown in Fig. 83 is a conceptual 
model for prescribing the recording process. The DVR MPEG 2 
transport stream obeys this model. 

[0379] The input timing of the MPEG 2 transport stream is now 
explained. The input MPEG 2 transport stream is a full transport 
stream or a partial transport stream. The input MPEG 2 transport 
stream must obey the ISO/IEC138 18-1 or ISO/IEC 13818-9. The 
number i byte of the MPEG 2 transport stream is input 
simultaneously at time t(i) to T-STD (transport stream system 
target decoder provided for in SO/IEC138 1 8 -1 ) and to the source 
packetizer. Rpk is an instantaneous maximum value of the input 
rate of the transport packet. 

[0380] A 27 MHz PLL 52 generates a frequency of 27 MHz clock. 
The 27 MHz clock frequency is locked at a value of the program 
clock reference (PCR) of the MPEG 2 transport stream. An arrival 
time clock counter 53 counts the pulses of the 27 MHz frequency. 
Arrival_time_clock (i) is a count value of the arrival time clock 
counter at time t(i). 

[0381] A source packetizer 54 appends TP_extra_ header to the 
totality of the transport packets to create a source packet. 
Arrival_time_stamp indicates the time when the first byte of the 
transport packet reaches both the T-STD and the source 
packetizer. Arrival_time_s tamp ( k) is a sampled value of the 
Arrival_time_clock (k) as represented by the following equation: 
arrival_time_stamp (k) = arrival_time_clock ( k) % 230 
where k denotes the first byte of the transport packet. 



[0382] If the time separation between two neighboring 
transport packets is 230/2 7000000 sec (about 40 sec) or longer, 
the difference of the arrival_time_stamp of the two transport 
packets should be set to 230/2 7000000 sec. The recorder is 
provided for such case. 

[0383] A smoothing buffer 55 smoothes the bit rate of the 
input transport stream. The smoothing buffer must not overflow. 
Rmax is the output bit rate of the source packet from the 
smoothing buffer when the smoothing buffer is not null. If the 
smoothing buffer is null, the output bit rate of the smoothing 
buffer is 0. 

''' [0384] Next, the parameters of the recorder model of the DVR 

=|1 MPEG 2 transport stream are explained. The value of Rmax is 

[, given by TS_recording_rate as defined in ClipInfoO associated 

with the AV stream file. This value may be calculated from the 

following equation: 

N 5 Rmax = TS_recording_rate*192/188 

Q 

p* where the value of TS_recording_rate is of a size in 

bytes/second . 

[0385] If the input transport stream is an SESF transport 

stream, Rpk must be equal to TS_recording_rate as defined in 
ClipInfoO associated with the AV stream file. If the input 
transport stream is not an SESF transport stream, reference may 
be made to values defined, e.g., in a descriptor of the MPEG 2 
transport stream, such as maximum_bit rate_descriptor or 
partial_stream_descriptor for this value. 

[0386] If the input transport stream is an SESF transport 
stream, the smoothing buffer size is 0 . If the input transport 
stream is not an SESF transport stream, reference may be made to 
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values defined in the descriptor of the MPEG 2 transport stream, 
such as, for example, the values defined in the 
smoothing_buf f er_descriptor, short_smoothing _buf f er_descriptor 
or in the partial_transport_stream_descriptor . 

[0387] For the recorder and the player (reproducing 

apparatus) , a sufficient size buffer needs to be provided. The 
default buffer size is 1536 bytes. 

[0388] Next, a player model of the DVR MPEG 2 transport stream 
is explained. Fig. 84 shows a player model of the DVR MPEG 2 
transport stream. This is a conceptual model for prescribing the 
reproduction process. The DVR MPEG 2 transport stream obeys this 
model . 

[0389] A 27 MHz crystal 61 generates the frequency of 27 MHz. 

An error range of the 27MHz frequency must be ±30 ppm (2 7000000 
± 810 Hz) . The arrival time clock counter 62 is a binary counter 
for counting the pulses of the frequency of 27 MHz. 
Arrival_time_clock (i) is a count value of the arrival time clock 
counter at time t(i). 

[0390] In the smoothing buffer 64, Rmax is the input bit rate 
of the source packet to the smoothing buffer when the smoothing 
buffer is not full. If the smoothing buffer is full, the input 
bit rate to the smoothing buffer is 0. 

[0391] By way of explaining the output timing of the MPEG 2 
transport stream, if the arrival_time_stamp of the current source 
packet is equal to 30 bits on the LSB side of 
arrival_time_clock (i) , the transport packet of the source packet 
is removed from the smoothing buffer. Rpk is an instantaneous 
maximum value of the transport packet rate. The overflow of the 
smoothing buffer is not allowed. 



[0392] The parameters of the player model of the DVR MPEG 2 
transport stream are the same as those of the recorder model of 
the DVR MPEG 2 transport stream described above. 

[0393] Fig. 85 shows the syntax of the source packet. 

Transport_packet ( ) is an MPEG 2 transport stream provided in 
ISO/IEC 13818-1. The syntax of TP_extra_header in the syntax of 
the source packet shown in Fig. 85 is shown in Fig. 86. By way 
of explaining the syntax of the TP_extra_header , shown in Fig. 
86, copy_permission_indicator is an integer representing the 
copying limitation of the payload of the transport packet. The 
copying limitation may be copy free, no more copy, copy once or 
I1J copying prohibited. Fig. 87 shows the relation between the value 

yj of copy_permission_indicator and the mode it designates. 

Ilj [0394] Copy_permission_indicator is appended to the totality 

"L of transport packets. If the input transport stream is recorded 

- using the IEEE1394 digital interface, the value of 

copy_permission_indicator may be associated with the value of EMI 

o 

M= (encryption mode indicator) . If the input transport stream is 

recorded without employing the IEEE1394 digital interface, the 
value of copy_permission_indicator may be associated with the 
value of the CCI embedded in the transport packet. If an analog 
signal input is self-encoded, the value of 

copy_permission_indicator may be associated with the value of 
CGMS-A of the analog signal. 

[0395] Arrival_time_stamp is an integer having a value as 

specified by arrival_time_stamp in the following equation: 

arrival_time_stamp (k) = arrival_time_clock ( k) %230 . 
[0396] By way of defining the ClipAV stream, the ClipAV stream 
must have a structure of the DVR MPEG 2 transport stream defined 
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as described above. Arrival_time_clock ( i ) must increase 
continuously in the Clip AV stream. If there exists a 
discontinuous point of the system time base (STC base) in the 
Clip AV stream, arrival_time_clock ( i ) in the Clip AV stream must 
increase continuously. 

[0397] The maximum value of the difference of the 
arrival_time_clock (i) between the beginning and the end of the 
Clip AV stream must be 2 6 hours. This limitation guarantees 
that, if there is no discontinuous point in the system time base 

(STC base) in the MPEG 2 transport stream, the PTS (presentation 
time stamp) of the same value never appears in the Clip AV 
stream. The MPEG 2 system standard provides that the PTS has a 
wraparound period of 233/90000 sec (about 26.5 hours). 

[0398] By way of defining the Bridge-Clip AV stream, the 
Bridge-Clip AV stream must have a structure of the DVR MPEG 2 
transport stream defined as described above. The Bridge-Clip AV 
stream must include a discontinuous point of one arrival time 
base. The transport stream ahead and at back of the 
discontinuous point of the arrival time base must obey the 
encoding limitations and the DVR-STD as later explained. The 
present embodiment supports the video-audio seamless connection 
between Playltems being edited. Seamless connection between 
Playltems guarantees "continuous data supply" to the 
player/decoder and "seamless decoding processing". The 
"continuous data supply" is the capability of guaranteeing data 
supply to the decoder at a bit rate necessary to prevent buffer 
underflow. In order to enable data to be read out from the disc 
as data real-time properties are assured, data is to be stored in 
terms of a continuous block of a sufficiently large size as a 



"[0399] The "seamless decoding processing" means the capability 

of a player in displaying audio video data recorded on the disc 
without producing pause or gap in the playback output of the 
decoder . 

[0400] The AV stream, referenced by the seamless connected 
Playltems, is explained. Whether or not the seamless display of 
a previous Playltem and the current Playltem is guaranteed may be 
verified from the connection_condition field defined in the 
current Playltem. There are two methods for seamless connection 
of Playltems, that is, a method employing Bridge-Clip and a 
method not employing Bridge-Clip. 

[0401] Fig. 88 shows the relation between the previous 
Playltem and the current Playltem in case of employing Bridge- 
Clip. In Fig. 88, the stream data, read out by the player, is 
shown shaded. In Fig. 88, TS1 is made up of shaded stream data 
of the Clipl (Clip AV stream) and shaded stream data previous to 
RSPN_arrival_time_discontinuity . 

[0402] The shaded stream data of Clipl of TS1 is stream data 
from an address of a stream required for decoding the 
presentation unit corresponding to IN_item of the previous 
Playltem (shown as IN-timel in Fig. 88) up to the source packet 
referenced by RSPN_exit_f rom_previous_Clip . The shaded stream 
data prior to RSPN_arrival_time_discontinuity of Bridge-Clip 
contained in TS1 is stream data as from the first source packet 
of Bridge-Clip up to the source packet directly previous to the 
source packet referenced by RSPN_arrival_time_discontinuity . 
[0403] In Fig. 88, TS2 is made up of shaded stream data of 
Clip 2 (Clip AV stream) and shaded stream data subsequent to 
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RSPN_arrival_time_discontinuity of Bridge-Clip. The shaded stream 
data as from the RSPN_arrival_time_discontinuity of Bridge-Clip 
contained in TS2 is stream data from the source packet referenced 
by RSPN_arrival_time_discontinuity to the last source packet of 
Bridge-Clip. The shaded stream data of Clip2 of TS2 is stream 
data from the source packet referenced by 

RSPN_enter_to_current_Clip to the address of the stream required 
for decoding the presentation unit corresponding to OUT_time of 
current Playltem (shown by 0UT_time2 in Fig. 88) . 
[0404] Fig. 8 9 shows , the relation between the previous 
Playltem and the current Playltem in case of not employing 
Bridge-Clip. In this case, the stream data read out by the 
player is shown shaded. In Fig. 89,TS1 is made up of shaded 
stream data of the Clipl (Clip AV stream) . The shaded stream 
data of Clipl of TS1 is data beginning at an address of a stream 
necessary in decoding a presentation unit corresponding to 
IN_time of the previous Playltem, shown at IN_timel in Fig. 89, 
as far as the last source packet of Clipl. 

[0405] In Fig. 8 9, TS2 is shaded stream data of Clip2 (Clip AV 
stream) . The shaded stream data of Clip2 of TS2 is stream data 
beginning at a first source packet of Clip2 as far as an address 
of the stream necessary for decoding the presentation unit 
corresponding to 0UT_time of current Playltem (shown at 0UT_time2 
in Fig. 89) . _ 

[0406] In Figs. 88 and 89, TS1 and T2 are continuous streams 
of the source packet. Next, the stream provisions of TS1 and TS2 
and the connection conditions therebetween are scrutinized. 
First, encoding limitations for seamless connection are 
scrutinized. By way of limitations on the encoding structure of 
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a transport stream, the number of programs contained in TS1 and 
TS2 must be 1. The number of video streams contained in TS1 and 
TS2 must be 1. The number of audio streams contained in TS and 
TS2 must be 2 or less. The numbers of the audio streams 
contained in TS1 and TS2 must be equal to each other. It is also 
possible for elementary streams or private streams other than 
those depicted above to be contained in TS1 and/or TS2 . 
[0407] The limitations on the video bit stream are now 
explained. Fig. 90 shows a typical seamless connection indicated 
by a picture display sequence. In order for a video stream to 
be demonstrated seamlessly in the vicinity of a junction point, 
unneeded pictures displayed at back of OUT_timel (OUT_time of 
Clipl) and ahead of IN_time2 (IN_time of Clip2) must be removed 
by a process of re-encoding the partial stream of the Clip in the 
vicinity of the junction point. 

[0408] Fig. 91 shows an embodiment of realizing seamless 

connection using BridgeSequence . The video stream of Bridge-Clip 
previous to RSPN_arrival_time_discontinuity is comprised of an 
encoded video stream up to a picture corresponding to 0UT_timel 
of Clipl of Fig. 90. This video stream is connected to the video 
stream of previous Clipl and is re-encoded to form an elementary 
stream conforming to the MPEG2 standard. 

[0409] The video stream of Bridge-Clip subsequent to 

RSPN_arrival_time_discontinuity is made up of an encoded video 
stream subsequent to a picture corresponding to IN_time2 of Clip2 
of Fig. 90. The decoding of this video stream can be started 
correctly for connecting the video stream to the next following 
Clip2 video stream. Re-encoding is made such that a sole 

continuous elementary stream conforming to MPEG 2 standard will 
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be formed. For creating Bridge-Clip, several pictures in general 
need to be re-encoded, whilst other pictures can be copied from 
the original Clip. 

[0410] Fig. 92 shows an embodiment of realizing seamless 
connection without employing BridgeSequence in the embodiment 
shown in Fig. 90. The Clipl video stream is comprised of an 
encoded video stream as far as the picture corresponding to 
OUT_timel of Fig. 90 and is re-encoded so as to give an 
elementary stream conforming to the MPEG2 standard. In similar 
manner, the video stream of Clip2 is made up of encoded bit 
streams subsequent to the picture associated with IN_time2 of 
ft! Clip2 of Fig. 90. These encoding bit streams are already re- 

*fj encoded to give a sole continuous elementary stream conforming to 

... 

2 the MPEG2 standard. 

j\ [0411] By way of explaining encoding limitations of the video 

stream, the frame rates of the video streams of TS1 and TS2 must 
be equal to each other. The video stream of TS1 must be 

P 

H terminated at sequence_end_code. The video stream of TS2 must 

commence at Sequence header, GOP Header and with an I-picture. 
The video stream of TS2 must commence at a closed GOP. 
[0412] The video presentation units defined in a bit stream 
(frame or field) must be continuous with a junction point in- 
between. No gap of the fields or frames are allowed to exist at 
junction points. In case of encoding employing 3-2 pulldown, it 
may be necessary to rewrite "top_f ield_f irst " and 
"repeat_f irst_f ield" flags. Alternatively, local re-encoding may 
be made to prevent field gaps from being produced. 
[0413] By way of explaining encoding limitations on the audio 
bit stream, the audio sampling frequency of TS1 and that of TS2 
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must be equal to each other. The audio encoding method of TS1 
and that of TS2 (for example, MPEG1 layer 2, AC-3, SESF LPCM and 
AAC) must be equal to each other. 

[0414] By way of explaining encoding limitations on MPEG-2 
transport stream, the last audio frame of the audio stream of TS1 
must contain audio samples having a display timing equal to the 
display end time of the last display picture of TS1. The first 
audio frame of the audio stream of TS2 must contain an audio 
sample having a display timing equal to the display start timing 
of the first display picture of TS2. 

[0415] At a junction point, no gap may be allowed to exist in 
a sequence of the audio presentation units. As shown in Fig. 93, 
there may be an overlap defined by the length of the audio 
presentation unit less than two audio frame domains. The first 
packet transmitting an elementary stream of TS2 must be a video 
packet. The transport stream at the junction point must obey the 
DVR-STD which will be explained subsequently. 

[0416] By way of explaining limitations on the Clip and 
Bridge-Clip, no discontinuities in the arrival time base are 
allowed to exist in TS1 or in TS2. 

[0417] The following limitations are applied only to the case 

of employing the Bridge-Clip. The Bridge-Clip AV stream has a 
sole discontinuous point in the arrival time base only at a 
junction point of the last source packet of TS1 and the first 
source packet of TS2 . The SPN_arrival_time_discontinuity defined 
in ClipInfo() represents an address of the discontinuous point, 
which must represent the address referencing the first source 
packet of TS2 . 

[0418] The source packet referenced by 
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RSPN_exit_f rom_previous_Clip defined in BridgeSequencelnf o ( ) may 
be any source packet in Clipl. It is unnecessary for this source 
packet to be a boundary of the Aligned unit. The source packet 
referenced by RSPN_enter_to_current_Clip defined in 
BridgeSequencelnf o ( ) may be any source packet in Clip2 . It is 
unnecessary for this source packet to be a boundary of the 
Aligned unit. 

[0419] By way of explaining limitations on Playltem, the 

0UT_time of the previous Playltem (0UT_time 1 shown in Fig. 89) 
must represent the display end time of the last video 
presentation unit of TS1. The IN_time of the current PlayTime 
(IN_time2 shown in Fig. 88 and 89) must represent the display 
start time of the first presentation unit of TS2 . 
[0420] By way of explaining the limitations on the data 
allocation in case of employing Bridge-Clip by referring to Fig. 
94, the seamless connection must be made to guarantee continuous 
data supply by the file system. This must be realized by 
arranging the Bridge-Clip AV stream, connecting to Clipl (Clip AV 
stream file) and Clip2 (Clip AV stream file) , such as to satisfy 
data allocation prescriptions. 

[0421] RSPN_exit_f rom_previous_Clip must be selected so that 
the stream portion of Clipl (Clip AV stream file) previous to 
RSPN_exit_from_previous_Clip will be arranged in a continuous 
area not less_ than half fragment. The data length of the Bridge- 
Clip AV stream must be selected so that the data will be arranged 
in the continuous area not less than half fragment. 
RSPN_enter_to_current_Clip must be selected so that the stream 
portion of Clip2 (Clip AV stream file) subsequent to 
RSPN_enter_to_current_Clip will be arranged in a continuous area 
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not less than half fragment. 

[0422] By way of explaining data allocation limitations in 

case of seamless connection not employing Bridge-Clip, by 
referring to Fig. 95, the seamless connection must be made so as 
to guarantee continuous data supply by the file system. This 
must be realized by arranging the last portion of the Clipl (Clip 
AV stream file) and the first portion of the Clip2 (Clip AV 
stream file) so that the provisions on data allocation will be 
met . 

[0423] The last stream portion of Clipl (Clip AV stream file) 
must be arranged in a continuous area not less than one half 
fragment. The first stream portion of Clip2 (Clip AV stream 
file) must be arranged in a continuous area not less than one 
half fragment. 

[0424] In a case where digital AV signals each having a 
predetermined bit rate are fragmented and recorded on a disc, to 
assure that the recorded digital AV signals can be read out from 
the recording medium 100 in the predetermined bit rate, the size 
of one continuous recording area must satisfy the following 
condition : 

S*8 / (S*8/Rud + Ts) >= Rmax 

where 

S: minimum size of one continuous recording area [Byte] 

Ts: access time of full stroke from one recording area to 
next recording area [second] 

Rud: bit rate for reading out from recording medium 
[bit/second] 

Rmax: bit rate of AV stream [bit/second] . 
[0425] That is, the data must be arranged so that S byte or 
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more data in the AV stream may be recorded continuously on the 
disc. The data must be arranged so that the size of the half 
fragment may be S byte or more. 

[0426] Next, DVR-STD is explained. This DVR-STD is a 
conceptual model for modeling the decoding processing in the 
generation and verification of the DVR MPEG 2 transport stream. 
The DVR-STD is also a conceptual model for modeling the decoding 
processing in the generation and verification of the AV stream 
referenced by two Playltems seamlessly connected to each other as 
described above. 

[0427] Fig. 96 shows a DVR-STD model. The model shown in Fig. 

96 includes, as a constituent element, a DVR MPEG 2 transport 
stream player model. The notation of n, TBn, MBn, EBn, TBsys, 
Bsys, Rxn, Rbxn, Rxsys, Dn, Dsys, On and Pn(k) is the same as 
that defined in T-STD of ISO/IEC 13818-1, wherein n is an index 
number of an elementary stream and TBn is a transport buffer of 
the elementary stream n. 

[0428] MBn is a multiplexing buffer of the elementary stream n 
and exists only for the video stream. EBn is an elementary 
stream buffer of the elementary stream n and is present only for 
the video stream. TBsys is a main buffer in a system target 
decoder for the system information for a program being decoded. 
Rxn is a transmission rate with which data is removed from TBn. 
Rbxn is a transmission rate with which the PES packet payload is 
removed from MBn and is present only for a video stream. 
[0429] Rxsys is a transmission rate with which data is removed 
from TBsys. Dn is a decoder of the elementary stream n. Dsys is 
a decoder pertinent to the system information of a program being 
decoded. On is a re-ordering buffer of the video stream n. Pn(k) 
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is a number k presentation unit of the elementary stream. 
[0430] The decoding process for DVR-STD is explained. During 
the time a sole DVR MPEG 2 transport stream is being reproduced, 
the timing of inputting the transport packet to TBI, TBn or TBsys 
is determined by arrival_time_s tamp of the source packet. The 
prescriptions for the buffering operation of TBI, MB1, EB1, TBn 
Bn, TBsys and Bsys are the same as those of the T-STD provided 
for in ISO/IEC 13818-1, while the prescriptions for the deciding 
and display operations are also the same as the T-STD provided 
for in ISO/IEC 13818-1. 

[0431] The decoding process during the time the seamlessly 
connected PlayLists are being reproduced is now explained. Here, 
the reproduction of two AV streams referenced by the seamlessly 
connected Playltems is explained. In the following explanation, 
the reproduction of TS1 and TS2, shown for example in Fig. 88, is 
explained. TS1 and TS2 are a previous stream and a current 
stream, respectively. 

[0432] Fig. 97 shows a timing chart for inputting, decoding 
and display of transport packets when transferring from a given 
AV stream (TS1) to the next AV stream seamlessly connected 
thereto (TS2) . During transfer from a preset AV stream (TS1) to 
the next AV stream seamlessly connected thereto (TS2) , the time 
axis of the arrival time base of TS2 is not the same as the time 
axis of the arrival time base of TS1 (indicated by ATC1 in Fig. 
97) . 

[0433] Moreover, the time axis of the system time base of TS2 

(indicated by STC2 in Fig. 97) is not the same as the time axis 
of the system time base of TS1 (indicated by STC1 in Fig. 97) . 
As the video display is required to be continuous seamlessly, 



however, there may be overlap in the display time of the 
presentation units. 

[0434] The input timing to DVR-STD is explained. During the 
time until time Tl, that is until the inputting of the last video 
packet to the TBI of DVR-STD, the input timing to the buffers of 
TBI, TBn or TBsys of DVR-STD is determined by arrival_time_stamp 
of the arrival time base of TS1. 

[0435] The remaining packets of TS1 must be input to buffers 
of TBn or to TBsys of DVR-STD at a bit rate of TS_recording_rate 

(TS1) . The TS_recording_rate (TS1) is the value of 

TS_recording_rate defined in ClipInfoO corresponding to Clipl. 
The time the last byte of TS1 is input to the buffer is the time 
T2 . So, during the time between time Tl and time T2, 
arrival_time_stamp of the source packet is discounted. 

[0436] If Nl is the number of bytes of the transport packet of 

TS1 next following the last video packet of TS1, the time DTI 
from time Tl until time T2 is the time necessary for Nl bytes to 
be input completely at a bit rate of TS_recording_rate (TS1) , and 
is calculated in accordance with the following equation: 
DTI = T2 - Tl = Nl/TS_recording_rate . 

[0437] During the time from time Tl until time T2 (TS1) , both 
the values of Rxn and Rxsys are changed to the value of 
TS_recording_rate (TS1) . Except for this rule, the buffering 
operation is _the same as that of T-STD. 

[0438] At time T2, the arrival time clock counter is reset to 
the value of arrival_time_stamp of the first source packet of 
TS2 . The input timing to the buffer of TBI, TBn or TBsys of DVR- 
STD is determined by arrival_time_stamp of the source packet of 
TB2 . Both Rxn and Rxsys are changed to values defined in T-STD. 
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[0439] By way of explaining additional audio buffering and 
system data buffering, the audio decoder and the system decoder 
need to have an additional buffering amount (data amount 
equivalent to one second) in addition to the buffer amount 
defined in T-STD in order to allow input data of a domain from 
time Tl to time T2 . 

[0440] By way of explaining the video presentation timing, the 
display on the video presentation unit must be continuous, that 
is devoid of gaps, through the junction point. It is noted that 
STC1 is the time axis of the system time base of TS1 (indicated 
as STC1 in Fig. 9), while STC2 is the time axis of the system 
time base of TS2 (shown at STC2 in Fig. 97; correctly, STC2 
begins at time the first PCR of TS2 has been input to the T-STD) . 

[0441] The offset between STC1 and STC2 is determined as 
follows: the PTSlend is the PTS on STC1 corresponding to the last 
video presentation unit of TS2. PTS2start is PTS on STC2 
corresponding to the first video presentation unit of TS2 and Tpp 
is the display time period of the last video presentation unit of 
TS1, the offset STC_delta between two system time bases is 
calculated in accordance with the following equation: 

STC_delta = PTSlend + Tpp - PTS2start. 
[0442] By way of explanation of the audio presentation timing, 
there may be overlap in the display timing of the audio 
presentation unit, with the overlap being less than 0 to 2 audio 
frames (see "audio overlap" shown in Fig. 97) . The indication as 
to which of audio samples is to be selected and re- 
synchronization of the display of the audio presentation unit to 
the corrected time base at back of the junction point are set on 
the player. 
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[0443] By way of explaining the system time clock of DVR-STD, 
the last audio presentation unit of TS1 is displayed at time T5 . 
The system time clock may be overlapped between time T2 and time 
T5. During this time domain, the DVR-STD switches the system 
time clocks between the value of the old time base (STC1) and the 
value of the new time base (STC2) . The value of STC2 may be 
calculated in accordance with the following equation: 

STC2 = STC1 - STC_delta. 
[0444] The buffering continuity is now explained. 
STCllvideo_end is the value of STC on the system time base STC2 
when the first byte of the first video packet reaches TBI of DVR- 
STD. STC22video_start is the value of STC on the system time 
base STC2 when the first byte of the first video packet reaches 
TBI of DVR-STD. STC2 lvideo_end is the value of STCllvideo_end 
calculated as the value on STC2 of the system time base STC2 . 
STC21video_end is calculated in accordance with the following 
equation : 

STC21video_end = STCllvideo_end - STC_delta. 
[0445] In order to obey DVR-STD, the following two conditions 
must be met: first, the arrival timing of the first video packet 
of TS2 at TBI must satisfy the following inequality: 

STC22video_start > STC2 1 video_end + ATI. 
[0446] If it is necessary to re-encode and/or multiplex the 
partial stream of Clipl and/or Clip2 in such a manner that the 
above inequality will be satisfied, this re-encoding or 
multiplexing is performed as appropriate. 

[0447] Second, the inputting of the video packet from TS1 
followed by the inputting of the video packet from TS2 on the 
time axis of the system time base, mapped from STC1 and STC2 on 
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the same time axis, must not overflow or underflow the video 
buffer . 

[0448] If the above syntax, data structure and the rules are 
used as basis, the contents of data recorded on the recording 
medium or the reproduction information can be managed properly to 
enable the user to confirm the contents of data recorded on the 
recording medium at the time of reproduction or to reproduce 
desired data extremely readily. 

[0449] In the above-described embodiment, the MPEG 2 transport 

stream is taken as an example of the multiplexed stream. This, 
however, is merely exemplary, such that the MPEG 2 program stream 
DSS or the transport stream used in the DirecTV Service (trade 
mark) of USA may also be used as the multiplexed stream. 

[0450] Fig. 98 shows a modification of BridgeSequencelnf o ( ) . 
The difference from the BridgeSequencelnf o ( ) of Fig. 38 is that 
it contains only Bridge_Clip__Inf ormation_f ile_name . 

[0451] Fig. 99 illustrates Bridge_Clip when two Playltems are 

seamlessly connected in case of using the syntax of 
BridgeSequencelnf o () of Fig. 98. RSPN_exit_f rom_previous_Clip is 
the source packet number of the source packet on the Clip AV 
stream referenced by the previous Playltem. Next to this source 
packet is connected the first source packet of the Bridge_Clip AV 
stream file. 

[0452] RSPN_enter_to_current_Clip is the number of the source 
packet on the Clip AV stream referenced by the current Playltem. 
Ahead of this source packet is connected the last source packet 
of the Bridge-Clip AV stream file referenced by the current 
Playltem. In the Bridge-Clip AV stream, shown in Fig. 99, 
SPN_ATC_start indicates the source packet number of the source 
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packet in the Bridge-Clip AV stream file at which commences the 
time axis of new arrival time base. 

[0453] The Bridge-Clip AV stream file has a sole arrival time 

base discontinuous point. The second SPN_ATC_start in the 
drawings has the same meaning as that of the 
RSPN_arrival_time_discontinuity in Fig. 37. 

[0454] If the syntax of BridgeSequencelnf o ( ) of Fig. 98 is 
used, RSPN_exit_f rom_previous_Clip and RSPN_enter_to_current_Clip 
are stored in the Clip Information file corresponding to the 
Bridge-Clip AV stream file. SPN_ATC_start is also stored in the 
Clip Information file. 

[0455] Fig. 100 shows the syntax of the Clip Information file 
when the BridgeSequencelnf o is of the syntax of Fig. 98. 
Sequencelnf o_start_address indicates the leading address of 
Sequencelnf o ( ) , in terms of the number of relative bytes from the 
leading byte of the Clip Information file as a unit. The number 
of relative bytes is counted from 0. 

[0456] Fig. 101 shows the syntax of ClipInfo() of the Clip 
Information file of Fig. 100. Clip_stream_type indicates whether 
the AV stream file of the Clip is the Clip AV stream file or the 
Bridge-Clip AV stream file. If the Clip_stream_type indicates 
the Bridge-Clip AV stream file, the next syntax field follows. 
[0457] Previous_Clip_Inf ormation_f ile_name indicates the Clip 
Information file name connected ahead of the Bridge-Clip AV 
stream file. RSPN_exit_f rom_previous_Clip is the source packet 
number of the source packet on the Clip AV stream file indicated 
by previous_Clip_Inf ormation_f ile_name . Next to this source 
packet is connected the first source packet of the Bridge-Clip AV 
stream file. This source packet number is counted as from the 
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first source packet of the Clip AV stream file, with 0 as an 
initial value. 

[0458] Current_Clip_Inf ormation_f ile_name indicates the Clip 

Information file name of the Clip connected at back of the 
Bridge-Clip AV stream file. RSPN_enter_to_current_Clip is the 
source packet number of the source packet on the Clip AV stream 
file indicated by the current_Clip_Inf ormation_f ile_name . Ahead 
of this source packet is connected the last source packet of the 
Bridge-Clip AV stream file. This source packet number is counted 
as from the first source packet of the Clip with the AV stream 
file with 0 as an initial value. 

[0459] Fig. 102 indicates the syntax of the Sequencelnf o ( ) of 
the Clip Information file of Fig. 100. Num_of_ATC_sequences 
indicates the number of the ATC_sequence in the AV stream file. 
ATC_sequence is a source packet sequence not containing 
discontinuous points of the arrival time base. In the case of 
Bridge-Clip, this value is 2. 

[0460] SPN_ATC_start [atc_id] indicates an address at which 
commences the arrival time base indicated by atc_id on the AV 
stream file. SPN_ATC_start [atc_id] is of a size based on the 
source packet number as a unit and is counted as from the first 
source packet of the AV stream file with 0 as an initial value. 
[0461] Figs. 103A and 103B illustrate database change in case 

stream data of the Clip AV stream file referenced by Bridge- 
Sequence is partially erased. It is assumed that, as indicated 
by "Before Editing" in Fig. 103A, Clipl and Clip2 are 
interconnected by Bridge-Clip, with RSPN_exit_f rom_previous_Clip 
= X and RSPN_enter_to_current_Clip = Y. 

[0462] It is assumed that a stream data portion of Zl source 
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packets of Clipl, shown shaded, and a stream data portion of Zl 
source packets of Clip2, likewise shown shaded, are to be erased. 
As a result, RSPN_exit_f rom_previous_Clip becomes equal to X - Zl 
and RSPN_enter_to_current_Clip becomes equal to Y - Z2, by way of 
value changing, as indicated by "After Editing" in Fig. 103B. 
[0463] By changing the syntax of the database pertinent to 
BridgeSequence as shown in Figs. 98 and 101, the information 
pertinent to the source packet number indicating a data address 
in the AV stream (a field commencing from RSPN in the database 
syntax) is depleted from the layer of the PlayList, such that the 
totality of the information on the source packet number is stated 
in the Clip layer. 

[0464] In this case, if it is necessary to change the value of 

the data address in the AV stream, as when data of the AV stream 
file has been partially erased, it is sufficient if merely the 
Clip Information file is supervised as data, thus advantageously 
facilitating database management. 

[0465] Fig. 104 is a flowchart illustrating the preparation of 
the Real PlayList. Reference is had to the block diagram of Fig. 
1 showing the recording and/or reproducing apparatus 1. At step 
S10, a controller 23 records a Clip AV stream. At step Sll, the 
controller 23 forms PlayList () made up of Playltems covering the 
totality of the possible playback range of the above Clip. If 
there is STC discontinuous point in the Clip, and PlayList ( ) is 
made up of two or more Playltems, the connection_condition 
between the Playltems is also determined. 

[0466] At step S12, the controller 23 forms 
UIAppInfoPlayList ( ) . At step S13, the controller 23 forms 
PlayListMark. At step 14, the controller 23 forms 
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MakersPrivateData. At step S15, the controller 23 records a Real 
PlayList file. In this manner, one Real PlayList file is created 
each time a Clip AV stream file is newly recorded. 
[0467] Fig. 105 is a flowchart for illustrating the 
formulation of a Virtual PlayList having a bridge seguence. At 
step S2 0, reproduction of one Real PlayList recorded on the disc 
is specified through a user interface. From the reproduction 
range of the Real PlayList, the reproduction domain specified by 
IN and OUT points is specified through a user interface. 
[0468] At step S21, the controller 23 verifies whether or not 

the operation of specifying the reproduction range by the user 
has been finished in its entirety. If it is verified that the 
specifying operation has been finished, the controller 23 
proceeds to step S22 and, if otherwise, the controller 23 reverts 
to step S20 to repeat the subsequent processing. 

[0469] At step S22, the user determines the connection 

condition between two continuously reproduced Playltems 

(connection_condition) , through the user interface, or the 
controller 23 determines the connection condition. At step S23, 
the controller 23 forms a bridge sequence for the seamlessly 
connected Playltems. At step S54, the controller 23 forms and 
records the Virtual PlayList file. 

[0470] Fig. 106 is a flowchart for illustrating the detailed 
processing at step S23. At step S31, the controller 23 re- 
encodes and re-multiplexes an AV stream on the OUT point side of 
the Playltem displayed on the temporally forward side. At step 
S32, the controller 23 re-encodes and re-multiplexes the AV 
stream on the IN point side of the Playltem represented next to 
the Playltem. 
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[0471] At step S33, the controller 23 determines the value of 
the RSPN_exit_f rom_previous_Clip such as to satisfy the data 
allocation condition for continuous data supply. That is, 
RSPN_exit_f rom_previous_Clip must be selected so that the stream 
portion of the Clip AV stream file previous to 
RSPN_exit_f rom_previous_Clip will be recorded in a continuous 
area not less than the aforementioned half fragment on the 
recording medium (see Figs. 91 and 94) . 

[0472] At step S34, the controller 23 determines the value of 

RSPN_enter_to_current_Clip such as to satisfy the data allocation 
condition for continuous data supply. That is, 

RSPN_enter_to_current_Clip must be selected so that the stream 
portion of the AV stream file subsequent to 
RSPN_enter_to_current_Clip will be arrayed in a continuous area 
not less than the aforementioned one fragment on the recording 
medium (see Figs. 91 and 94). 

[0473] At step S35, the controller 23 forms the Bridge-Clip AV 
stream file such as to satisfy the data allocation condition for 
continuous data supply. That is, if the volume of data prepared 
in the processing of steps S31 and S32 is less than the size of 
the aforementioned one half fragment, data is copied from the 
original Clip to prepare Bridge-Clip (see Figs. 91 and 94). 
[0474] Although the processing operations of steps S33 to S35 
is explained chronologically, these processing operations may 
also be carried out non-sequentially or simultaneously since 
these processing operations are relevant reciprocally. 
[0475] At step S36, the controller 23 forms a bridge sequence 
database. At step S37, the controller 23 records a Bridge-Clip 
AV stream file and its Clip information file. In this manner, one 
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or more Playltem is selected by the user from the reproduction 
range of Real PlayList recorded on the disc. A bridge sequence 
for enabling seamless connection of two Playltems is formed and a 
set of one or more Playltems, grouped together, is recorded as 
one Virtual PlayList. 

[0476] Fig. 107 is a flowchart for illustrating the 
reproduction of PlayList. At step S41, the controller 23 
acquires Info.dvr, a Clip Information file, a PlayList file and 
the thumbnail information to prepare a GUI picture of a list of 
PlayLists recorded on the disc and to display the so formed 
3 picture on the GUI through the user interface. 

Ft! [0477] At step S42, the controller 23 presents the information 

illustrating the PlayList on the GUI picture based on 
=7= UIAppInf oPlayList ( ) of each PlayList. At step S43, the user 

f, instructs reproduction of one PlayList from the GUI picture 

-J- through a user interface. At step S44, the controller 23 

M> acquires, from the PTS of IN_time and STC_sequence_id of the 

§** current Playltem, a source packet having an entry point 

temporally previous and closest to IN_time . 

[0478] At step S45, the controller 23 reads out data of the AV 

stream from the source packet number having the above entry point 
to send the so read out data to the decoder. If, at step S46, 
there is any Playltem temporally previous to the current 
Playltem, the controller 23 performs the process of 
interconnecting the previous Playltem and the current Playltem in 
accordance with connect ion_condition . If the Playltems are 
interconnected seamlessly, the AV stream is decoded based on the 
DVR-STD decoding method. 

[0479] At step S47, the controller 23 commands the AV decoder 
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27 to start the display as from the picture of the PTS of 
IN_time . At step S48, the controller 23 commands the AV decoder 
27 to continue decoding the AV stream. At step S49, the 
controller 23 verifies whether or not the picture currently- 
displayed is the picture of PTS of OUT_time . If it is verified 
that the picture currently displayed is not the picture of PTS of 
OUT_time, the controller 23 proceeds to step S50 to display the 
picture, after which the controller 23 reverts to step S48 to 
repeat the subsequent processing. 

[0480] If it is verified at step S49 that the picture 

currently displayed is the picture of PTS of 0UT_time, the 
controller 23 proceeds to step S51 where the controller 23 
verifies whether or not the current Playltem is the last one in 
the PlayList. If it is verified that the current Playltem is not 
the last one, the controller 23 reverts to step S44 to repeat the 
subsequent processing. If it is verified that the current 
Playltem is the last one, reproduction of the PlayList is 
finished . 

[0481] In this manner, the one PlayList file, specified to be 

reproduced by the user, is reproduced. 

[0482] Based on the above-described syntax, data structure and 
rule, as described above, the contents of data recorded on the 
recording medium, the playback information and so forth can be 
properly managed to enable the user to confirm the contents of 
data recorded on the recording medium properly to reproduce the 
desired data extremely readily. 

[0483] The above-described sequence of operations may be 
executed not only by hardware but also by software. If the 
sequence of operations is to be executed by software, it is 
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installed from a recording medium to a computer in the dedicated 
hardware of which is installed the program forming the software 
or to, e.g., a general-purpose personal computer capable of 
executing various functions based on a variety of programs 
installed therein. 

[0484] Fig. 108 shows an illustrative inner structure of a 
general-purpose computer. A CPU (central processing unit) 201 of 
the personal computer executes a variety of processing operations 
in accordance with a program stored in a ROM (read-only memory) 
202. In a RAM (random-access memory) 203, there are suitably 
stored data or programs necessary for the CPU 201 to execute 
various processing operations. To an input/output interface 205 
is connected an input unit 20 6 formed by a keyboard, a mouse and 
so forth to output signals input to the input unit 206 to the CPU 
201. To the input/output interface 205, there is also connected 
an output unit 207 made up of a display, a loudspeaker and so 
forth. 

[0485] To the input/output interface 205, there are also 
connected a recording unit 208 made up, e.g., of a hard disc and 
a communication unit 2 09 adapted for exchanging data with outer 
equipment over a network, such as the Internet. A drive 210 is 
used for reading out or writing data on or from a recording 
medium, such as a magnetic disc 221, an optical disc 222, a 
magneto-optical disc 223 or a semiconductor memory 224. 
[0486] The recording medium is constituted not only by a 

package medium distributed for furnishing the program to the 
user, in addition to a computer, such as a magnetic disc 221 
carrying the program thereon, inclusive of a floppy disc, an 
optical disc 222, inclusive of a CD-ROM (Compact Disc-Read-Only 
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memory) or a DVD (Digital Versatile Disc), a magneto-optical disc 
223, inclusive of a Mini-Disc, or a semiconductor memory 224, but 
also by a hard disc, inclusive of a ROM 202 carrying a program 
and a memory 208, furnished to the user as it is built-in in a 
computer, as shown in Fig. 108. 

[0487] In the present specification, the steps of the program 
furnished by the medium include not only the chronological 
processing in accordance with the sequence indicated, but also 
the processing performed not chronologically but in parallel or 
separately . 

[0488] Additionally, in the specification, the system means an 
entire apparatus comprised of plural component devices. 

[0489] In the information processing method and apparatus, and 
the program, according to the present invention, if the it is 
commanded to perform reproduction continuously from the first AV 
stream to the second AV stream, there is generated a third AV 
stream made up of a preset portion of the first AV stream and a 
preset portion of the second AV stream and which is reproduced 
when reproduction is switched from the first AV stream to the 
second AV stream, while there is generated, as information 
pertinent to the third AV stream, the address information made up 
of the information on an address of a source packet of the first 
AV stream at a time of switching of reproduction from the first 
AV stream to the third AV stream, and of the information on the 
address of a source packet of the second AV stream at a time of 
switching of reproduction from the third AV stream to the second 
AV stream. So, reproduction may be achieved such as to maintain 
continuity in separately recorded AV streams. 

[0490] In the information processing method and apparatus, and 
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the program, according to the present invention, a first AV 
stream, a second AV stream or a third AV stream is read out from 
a recording medium, the address information made up of the 
information on an address of a source packet of the first AV 
stream at a time of switching of reproduction from the first AV 
stream to the third AV stream, and the information on the address 
of a source packet of the second AV stream at a time of switching 
of reproduction from the third AV stream to the second AV stream, 
is read out from the recording medium as information pertinent to 
the third AV stream, and reproduction is switched from the first 
AV stream to the third AV stream and from the third AV stream to 
the second AV stream, as reproduction proceeds, based on the 
read-out information pertinent to the third AV stream. So, 
reproduction may be achieved such as to maintain continuity in 
separately recorded AV streams. 

[0491] Although the invention herein has been described with 
reference to particular embodiments, it is to be understood that 
these embodiments are merely illustrative of the principles and 
applications of the present invention. It is therefore to be 
understood that numerous modifications may be made to the 
illustrative embodiments and that other arrangements may be 
devised without departing from the spirit and scope of the 
present invention as defined by the appended claims. 
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